This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



Express Mail No. EL6 1 869C. /US 

PRODUCTION OF PHARMACEUTICAL PROTEINS 
IN TRANSGENIC PLASTIDS 
BACKGROUND 

(60/115,987) Research efforts have-been made to synthesize nigh value 
pharmacologically active recombinant proteins in plants. Recombinant proteins such as 
vaccines, monoclonal antibodies, hormones, growth factors, neuropeptides, cytotoxins, serum 
proteins and enzymes have been expressed in nuclear transgenic plants (May et ah, 1996). It has 
been estimated that one tobacco plant should be able to produce more recombinant protein than a 
300-liter fermenter of E. coli. In addition, a tobacco plant produces a million seeds, thereby 
facilitating large-scale production. Tobacco is also an ideal choice because of i ts relative ease of 
genetic manipulation and an impending need to explore alternate uses for this hazardous crop. 

(60/185,987) A primary reason for the high cost of production via ferm entation is the cost 
of carbon source co-substances as well as maintenance of a large fermentation facility. In 
contrast, most estimates of plant production are a thousand-fo ld less expensive than 
fermentation. Tissue specific expression of high value proteins in leaves can enable the use of 
crop plants as renewable resources. Harves ting the cobs, tubers, seeds or fruits for food and feed 
and leaves for value added products should result in further econom y with no additional 
investment. 

(60/185,987) However, one of the major limitations in producing pharmaceutical proteins 
in plants is their low level of foreign protein expression, despite reports of higher level 
expression of enzymes and certain pro teins. May et al. (1998) discuss this problem using the 
following examples. Although plant derived recombinan t hepatitis B surface antigen was as 
effective as a commercial recombinant vaccine, the levels of expression in transgenic tobacco 
were low (0.01% of total soluble protein). Even though Norwalk virus capsid protein expressed 
in potatoes caused oral immunization when consumed as food (edible vaccine), expression levels 
were low (0.3% of total soluble protein). A synthetic gene coding for the human epidermal 
growth factor was expressed only up to 0.001% of total soluble protein in transgenic tobacco. 
Human serum albumin has been expressed only up to 0.02% of the total soluble protein in 
transgenic plants. 

(60/185,987) Therefore, it is important to increase levels of expression of recombinant 
proteins in plants to exploit plant production of pharmacologically important proteins. An 
alternate approach is to express foreign proteins in chloroplasts of higher plant. Foreign genes 
(up to 10,000 copies per cell) have been incorporated into the tobacco chloroplast genome 
resulting in accumulation of recombinant proteins up to 30% of the total cellular protein 
(McBride et al., 1994). 
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(60/185,987) The aforementioned approaches (except chloropiast transformation) are 
limited to eukaryotic gene expression because prokaryotic genes are expressed poorly in the 
nuclear compartment. However, several phar macologically important proteins (such as insulin, 
human serum albumin, antibodies, enzymes etc.) are produced currently in E. coli. Also, several 
bacterial proteins (such as cholera toxin B subunit) are used as oral vaccines against diarrheal 
diseases. Therefore, it is important to develop a plant production system for expression of 
pharmacologically important proteins that are currently produced in prokaryotic systems (such as 
E, coli) via fermentation. 

(60/185,987) Chloroplasts are prokaryotic compartments inside eukar yotic cells. Since 
the transcriptional and translational machinery of the chloropiast is similar to E. coli (Brixey et 
al., 1997), it is possible to express prokaryotic genes at very high levels in plant chloroplasts than 
in the nucleus. In addition, plant cells contain up to 50,000 copies of the circular plastid genome 
(Bendich 1987) which may amplify the foreign gene lik e a "plasmid in the plant cell," thereby 
enabling higher levels of expression . Therefore, chloroplasts are an ideal choice for expression 
of recombinant proteins that are currently expr essed in E. coli (such as insulin, human serum 
albumin, vaccines, antibodies, etc.). We exploited the chloropiast transformation approach to 
Express a pharmacological protein that is of no value to the plant to demonstrate this concept, 
GVGVP gene has been synthesized with a codon preferred for pro karyotic (EG 121) or 

\m_ \ _ ... i — ■ '■ . 

eukaryotic (TG131) expression. Based on transcript levels, chloropiast expression of this 
polymer was a hundred-fold higher than nuclear expression in transg enic plants (Guda et al., 
1999). Recently, we observe d 16.966-fold more tps 1 transcripts in chlortiplast transformants 
than the highly expressing nuclear transgenic plants (Lee et al. 2000, in review). 

(60/263,668) Research on huma n proteins in the past years has revolutionized the use of 
these therapeutically valuable proteins in a variety of c linical situations. Since the demand for 
these proteins is expected to increase considerably in the coming years, it would be wise to 
ensure that in the future they will be availa ble in significantly larger amounts, preferably on a 
cost-effective basis. Because most genes can be expressed in many different systems, it is 
essential to determine which system offers the most advantages for the manufacture of the 
recombinant protein. An ideal expression system would be one that produces a maximum 
amount of safe, biologically active material at a minimum cost. The use of modified mammalian 
cells with recombinant DNA techniques has the advan tage of resulting in products, which are 
closely related to those of natural origin. Howev er, culturing these cells is intricate and can only 
be c arried out on limited scale. 

(60/263,668) The use of m icroorganisms such as bacteria permits manufacture on a larger 
scale, but introduces the disadvantage of producing products, which differ appreciably from the 
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products of natural origin. For example, proteins that are usually glycosylated in humans are not 
glycosylated by bacteria. Furt hermore, human proteins that are expressed at high levels in E.coli 
frequently acquire an unnatural conformati on, accompanied by intracellular precipitation due to 
lack of proper folding and disulfide bridges. Production of recombinant proteins in plants has 
many potential advantages for generating biopharmaceuticals rele vant to clinical medicine. 
These include the following: (i) plant systems are more economical than industrial facilities 
using fermentation systems; (ii) technology is avail able for harvesting and processing plants/ 
plant products on a large scale; (iii) elimination of the purification re quirement when the plant 
tissue containing the recombinant protein is used as a food (edible vaccines); (iv) plants can be 
directed to target proteins into stable, intracellular compartments as chloroplasts, or expressed 
directly in chloroplasts; (v) the amount of recombinant product that can be produced approaches 
industrial-scale levels; and (vi) health risks due to contamination with potential human 
pathogens/toxins are minimized. 

(60/263,668) It has been estimated that one tobacco plant shou ld be able to produce more 
recombinant protein than a 300-liter fermenter of E.coli (Crop Tech, VA). In addition, a tobacco 
plant can produce a million seeds, facilitating large-scale production. Tobacco is also an ideal 
choice because of its relative ease of genetic manipulation and an impending need to explore 
alternate uses for this hazardous crop. However, with the exception of enzymes (e.g. phytase) , 
t evels of foreign proteins produced in nuclear transgenic plants are generally low, mostly less 
than 1% of th e total soluble protein (Kusnadi et al. 1997).(Cholera Toxin Subunit B Filing) 
Protein accumulation levels o f recombinant enzymes, like phytase and xylanase were high in 
nuclear transg enic plants (14% and 4.1% of total soluble tobacco leaf protein respectively). This 
may be because their enzymatic nature made them more resistant to proteolytic degradation. 
(60/263,668) May et al. (1996) discuss this problem using the following examples. Although 
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plant derived recombinant hepatitis B surface antigen was as effective as a commercial 
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recombinant vaccine, the levels of expression in transgenic tobacco were low (0.0066% of total 

1 ■ : 

soluble protein). Even though Norwalk virus capsid protein expressed in potatoes caused oral 

immunization when consumed as food (edible vaccine), expressio n levels were low (0.3% of 

total soluble protein). 

(60/263,668) In particular, expression of hu man proteins in nuclear transgenic plants has 

been disappointingly low: e.g. human Interferon-p 0.000017% of fresh weight, human serum 

albumin 0.02% and erythropoietin 0.0026% of total soluble protein (see Table 1 in Kusnadi et al. 

1997). A sy nthetic gene coding for the human epidermal growth factor was expressed only up to 

0.001% of total soluble protein in transgenic tobacco (May et al. 1996). The cost of producing 

recombinant proteins in alfalfa leaves was qstimated to be 12-fold lower th an in potato tubers 
, . ; 1 ' — r — 
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and comparable with seeds (Kusnadi et al. 1997). However, tobacco leaves are much larger and 
have much higher biomass than alfalfa, Planet Biotechnology has recently estimated that at 50 
mg/liter of mammalian cell culture or transgenic goat's milk or 50mg/kg of tobacco leaf 
expression, the cost of purified IgA will be $10,000, 1000 and 50/g, respectively (Daniell et al. 
2000). The cost of production of recombinant proteins will be 50-fold lower than that of E.coli 
fermentation (with 20% expression levels in E.coli) (Kusnadi et al. 1997). A decrease in insulin 
expression from 20% to 5% of biomass doubled the cost of production in E.coli (Petridis et al. 
1995). Expression level less than 1% of total soluble protein in plants has been found to be not 
commercially feasible (Kusnadi et al. 1997). Therefore, it is important to increase levels of 
expression of recombinant proteins in plants to exploit plant production of pharmacologically 
^important proteins. 

(60,263,668) An alternate ap proach is to express foreign proteins in chloroplasts of 
higher plants. We have recently integrated foreign gen es (up to 10,000 copies per cell) into the 
tobacco chloroplast genome resulting in accumulation of recombinant proteins up to 46% of the 
total soluble protein (De Cosa et al. 2001). Chloroplast transformation utilizes two flanking 
sequences that, through homologous reco mbination, insert foreign DNA into the spacer region 
between the functional genes of the chloroplast gen ome, thereby targeting the foreign genes to a 
precise location. This eliminates the position effect and gene silencing frequently observed in 
nuclear transgenic plants. Chloroplast genetic engineering is an environmentally friendly 
approach, minimizing concerns of out-cross of introduced traits via pollen to weeds or other 
crops (Bock and Hagemann 2000, Heifetz 2000). Also, the concer ns of insects developing 
resistance to biopesticides are minimized by hyper-expression of sing le insecticidal proteins 
(high dosage) or expression of different types of insecticides in a single transformation event 
(gene pyramiding). Concerns of insecticidal proteins on non-target insects are minimized by 
lack of expression in transgenic pollen (De Cosa et al. 2001). 

(60/263,668) Importantly , a significant advantage in the production of pharmaceutical 
proteins in chloroplasts is their ability to process eukaryotic proteins, including folding and 
.formation of disulfide bridges (Drescher et al. 1998). Chaperonin proteins are present in 

/ : ~ 

chloroplasts (Roy, 1989; Vierlin g, 1991) that function in folding and assembly of 
prokaryotic/eukaryotic proteins. Also, proteins are activated by disulfide bond oxido/reduction 
/^cycles using the chloroplast thioredoxin system (Reulland and M iginiac-Maslow, 1999) or 
chloroplast protein disulfide isomerase (Kim and Mayfield, 1997). Accumulation of fully 
assembled, disulfide bonded form of human somatotropin via chloroplast transformation (Staub 
et al. 2000), oligomeric form of CTB (Henriques and Daniell, 2000) and the assembly of 
heavy/light chains of humanized Guy's 13 antibody in transgenic chloroplasts (Panchal et al. 
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2000) provide strong evidence for successful processing of pharmaceutical proteins inside 

chloroplasts. Such folding a nd assembly should eliminate the need for highly expensive in vitro 

processing of pharmaceutical p roteins. For example, 60% of the total operating cost in the 

production of human insulin is associated with in vitro processing (formati on of disulfide bridges 

and cleavage of methionine, Petridis et al. 1995). 

(60/263,668) Another major cost of insulin production is purification. Chromatography 

accounts for 30% of operating expenses and 70 % of equipment in production of insulin (Petridis 

et al. 1995). Therefore, new approaches are needed to minimize or eliminate chromatography in 

insulin production. One such approach is the use of GVGVP a s a fusion protein to facilitate 

single step purification without t he use of chromatography. GVGVP is a Protein Based Polymer 

, ~~ ■ 

(PBP) made from synthetic genes. At lower temper atures this polymer exists as more extended 

molecules. Upon raising the temperature abo ve the transition range, polymer hydrophobically 

folds into dynamic stru ctures called p-spirals that further aggregate by hydrophobic association 

to form twisted filaments (Urry, 1991; Urry et al., 1994). Inverse temperature transition offers 

several advan tages. It facilitates scale up of purification from grams to kilograms. Milder 

'purification con dition requires only a modest change in temperature and ionic strength. This 

should also facilitate higher recovery, faster puri fication and high volume processing. Protein 

purification is generally the slow step (bottleneck) in pharmaceutical product development . 

Through exploitation of this reversible inverse temperature transition property, simple and 

inexpensive extraction and purification may be perfor med. The temperature at which the 

aggregat ion takes place can be manipulated by engineering biopolymers containing varying 

numbers of repeats and changing salt concentration in solution (McPherson et al., 1996). 

Chloroplast medi ated expression of insulin-polymer fusion protein should eliminate the need for 

the expensive fermentation process as well as reagents needed for recombinant protein 

purification and downstream processing . 

(60/263,668) Oral delivery of insulin is yet another powerful approach that can eliminate 

up to 97% of the production cost of insulin (Petridis et al. 1995). For example, Sun et al. (1994) 

ftave shown that feeding a small dose of antigens conjugated to the receptor binding non-toxic B 

lubunit moiety of the cholera toxin (CTB) suppressed systemic T cell-mediated inflammatory 

reactions in animals. Oral administration of a myelin antigen conjugated to CTB has been 

shown to protect animals against encephalomyelitis, even when given after disease induction 

(Sun et al. 1996). Bergerot et al. (1997) reported that feeding small amounts of human insulin 

conjugated to CTB suppresse d beta cell destruction and clinical diabetes in adult non-obese 

diabetic (NOD) mice. The protective effect could be transferred by T cells from CTB -insulin 

V ~ " 

treated animals and was associated with reduced insulitis. These results demonstrate that 
T 
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protection against autoimmune di abetes can indeed be achieved by feeding small amounts of a 
pancreas islet cell auto antigen linked t o CTB (Bergerot et al. 1997). Conjugation with CTB 
facilitates antigen delivery and presentation to the Gut Associated Lymphoid Tissues (GALT) 
due to its affinity for the cell surface receptor GMi-ganglioside located on GALT cells, for 
increased uptake a nd immunologic recognition (Arakawa et al. 1998). Transgenic potato tubers 
ekpresse d up to 0.1% CTB-insulin fusion protein of total soluble protein, which retained GM t - 
ganglioside binding Affinity and native autogenicity for both CTB and insulin. NOD mice fed 
with transgenic potato tubers containing microgram quantities of CTB-insulin fusion protein 
showed a substantial reduction in insulitis and a delay in the progression of diabetes (Arkawa et 
al. i998). Ho wever, for commercial exploitation, the levels of expression should be increased in 
transgenic plants. Therefore, we propose here expression of CTB-insulin fusion in transgenic 
chloroplasts of nicot ine free edible tobacco to increase levels of expression adequate for animal 
testing. 

(60/263,668) Taken together, lo w levels of expression of human proteins in nuclear 

transgenic plants, and difficulty in fol ding, assembly/processing of human proteins in E.coli 

s hould make chloroplasts an alternate compartment for expression of these proteins. Production 

of human proteins in transge nic chloroplasts should also dramatically lower the production cost. 

Large-scale production of insulin in tobacco in conju nction with an oral delivery system can be a 

powerful approa ch to provide treatment to diabetes patients at an affordable cost and provide 

tobacco farmers alternate uses for this hazardous crop. Therefore, it is first advantageous to use 

poly(GVGVP) as a fusion protein to enabl e- hyper-expression of insulin and accomplish rapid 

one step purification of the f usion peptide utilizing the inverse temperature transition properties 

of this polymer. It is further advantageous to develop insulin-CTB fusion protein for oral 

delivery in nicotine free edible tobacco (LAMP 605). 

SUMMARY OF INVENTION (60/263,668) 

This inve ntion synthesizes high value pharmaceutical" proteins in transgenic plants by 
t — • 
chloroplast expression for pharmaceutical protein production. Chloroplasts are suitable for this 

purpose because of their ability to process eukaryotic proteins, including folding and formation 

of disulfide bridges, thereby eliminating the need for expensive post-purification processing. 

Tobacco is an ideal choice for this purpose because of its large biomass, ease of scale-up (million 

seeds per plant) and genetic manipulation. We use poly(GVGVP) as a fusion protein to enable 

- , ^ — . . ■ 

hyper-expression of insulin and accomplish rapid one step p urification of fusion peptides 
utilizing the inverse temperature transition properties of this polymer. We also use insulin-CTB 
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fusion protein in chloroplasts of nicotine free edible tobacco (LAMD 605) for oral delivery to 
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NOD mice. 
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(Cholera Toxin Subunit B filing) This invention includes expression of native cholera 
toxin B subunit gene as oligomers in transgenic tobacco chloroplasts which may be utilized in 
conne ction with large-scale production of purified CTB, as well as an edible vaccine if expressed 
in an edible plant or as a transmucosal carrier of peptides to which it is fused to either enhance 
mucosal immunity or to induce oral tolerance of the products of these peptides . 

BRIEF DESCRIPTION OF DRAWINGS (60/185,987 & 60/263,668} 
Figure 1 shows a nalysis of Biopolymer-Proinsulin Fusion Protein Expression. 
Figure 2 shows confirmation of chloroplast integration by PCR of polymer-proinsulin 

fusion gene. 
~-> 

Figure 3 shows CTB gen e expression in E. Coli and chloroplast integration. 

Fig. 4 shows graphs of Cry2A protein conce ntration determined by ELISA in transgenic 

leaves. 

Fig. 5 is an inmunogold labeled electron microscopy of a mature transgenic leaf. 
Fig. 6 contains photographs of leaves infected with 10 p.1 of 8xl0 5 , 8xl0 4 , 8xl0 3 and 
8xl0 2 cells of P. syringae five days after inoculation. 

7 ' Fig. 7 is a graph of total plant protein mixed with 5 \A of mid-log phase bacteria from 

/Overnig ht culture, incubated for two hours at 25°C at 125 rpm and grown in LB broth overnight. 

Fig. 8A is a graph of CTB ELI SA quantification shown as a percentage of total soluble 

•rr- — ■ *TZ * " 

plant protein. 

. Fig. 8B is a graph of CTB-GM1 Ganglioside binding ELISA assays. 

Fig. 9 is a 12% reducing PAGE using Chemiluminescent detection of CTB oligomer with 
rabbit anti-cholera serum (1°) and AP labeled mouse anti-rabbit IgG (2°) antibodies. 

Figs. 10A and B show reducin g gels of expression and assembly of disulfide bonded 
Guy's 13 monoclonal antibody . 

Fig. 10C shows a non-reducing gel of expression and assembly of disulfide bonded 
Guy's 13 monoclonal antibody . 

Figs. llX - F show photographs- comparin g betaine aldehyde and spectinomycin 
selection. 

Figs. 12A and B show biopolymer-proinsulin fusion protein expression in E. Coli. 

Fig. 13A shows western blots of biopolymer-proinsulin fusion protein after single step 
purificat ion in E. Coli. 

Fig. 13B shows western blots of another biopolymer -proinsulin fusion protein after single 
step purification in E. Coli. 
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Fig. 13C shows western blots of yet another biopolymer-proinsulin fusion protein after 
single step p urification in transgenic chloroplasts. 

Fig. 14 shows biopolym er-proinsulin fusion gene integration into the chloroplast genome 
confirmed by Southern blot analysis. 

Figure 15 is a graphical repres entation of total protein versus leafage in transgenic tobacco 
plants. 

Figure 16 is an electron micrograph showing Cry2Aa2 crystals in a transgenic tobacco leaf . 
Figure 17 is a photograph of leaves infected with P. syringae 5 days after inoculation. 
• Figure 18 is a graph showing the results of an in vitro assay of P. aeruginosa. 
Figure 19 is two graphs showing o ligomeric CTB expression levels as Total Soluble Protein. 
Figur e 20 is a Western Blot Analysis of transgenic chloroplast expressed CTB and commercially 

- available purified CTB antigen . 
Figure 21 is a Western Blot Analysis of heavy and light chains of Guy's 13 monoclonal antibody 

from plant chloroplasts. 
Figure 22 is a W estern Blot of transgenic potato tubers, cv Desiree expressing HSA. 
Figure 23 is a frequency histogram including percentage Kennebec and Desiree transgenci 

plants expressing different HAS levels. 

Figure 24 is a Western Blot of HAS Expression in E. coli. 
+ ~ — - — : : 

Figure 23' is a Western Blot of HAS expression in transgenic chloroplasts. 

Figure 26 shows the PCR analysis of transformants to determine integration of HSA gene into 
the chloroplast genome. 

Fig. 27 pLD -LH-CTB vector and PCR analysis of control and chloroplast transformants. 
A The perpendicular dotted line shows the vector sequences that are homologous to native 
chloro plast DNA, resulting in homologous recombination and site specific integration of the 
gene cassette into t he chloroplast genome. Primer landing sites are also shown. B. PCR 
analysis: 0.8% agarose ge l of PCR products using total plant DNA as template. 1 kb ladder 
'(lane l);Untr ansformed plant (lane 2); PCR products with DNA template from transgenic lines 1 
-10 (lanes 3 -12). 

run-; i 

Fig. 28 Western blot analysis of CTB expression in E.coli and chloroplasts. Blots were 
detected using rabbit anti-cholera serum as primary antibody and alkaline phosphatase labeled 
mouse anti -rabbit IgG as secondary antibody. A. E.coli protein analysis: Purified bacterial 
CTB, boiled (lane l); Unboiled 24 h and 48 h transformed (lanes 2 & 4) and untransformed 
(lanes 3 & 5) E. coli cel l extracts. Plant protein analysis: B. Color Development detection: 
Boiled, untransformed protein (lane l); Boiled, purified CTB antigen (lane 2): Boiled, protein 
from 4 different transgenic lines (lanes 3 - 6). C. Chemiluminescent detection: Plant protein- 
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Untransformed, unboiled (lane 1); Untransformed, boiled (lane 2); Transgenic lines 3 & 7, boiled 
(lanes 3 & 5), Transgenic line 3, unboiled (lane 4); Purified CTB antigen boiled (lane 6), 
unboiled (lane 7); Marker (lane 8) . 

Fi g. 29 S outhern blot analysis of T 0 and T, plants. A. Untransformed and transformed 
chloroplast genome: Transformed and untransformed plant DNA was digested with BgHI and 
hybri dized with the 0.81 kb probe that contained the chloroplast flanking sequences used for 
homologous recombination. Southern Blot results of To lines (B) Untransformed plant DNA 
(lane 1); Transformed lines DNA (lanes 2 - 4) and T t lines (C) Transformed pla nt DNA (lanes 1 - 
4) and Untransformed plant DNA (lane 5). 

Fig. 30 A. P lant phenotypes; 1: Confirmed transgenic line 7; 2: Untransformed plant B. 
10-day-old seedlings of T, transformed (1, 2 & 3) and untransformed plan t (4) plated on 
500mg/L spectinomycin selection medium. 

Fi g. 31 A. CTB ELISA quantification: Absorbance of CTB-antibody complex in known 
concentrations of total soluble plant protein was compared to absorbance of known concentration 
of bacterial CTB-antibody complex and the amount of CTB was expressed a s a percentage of the 
total soluble plant protein. Total soluble plant protein from young, mature a nd old leaves of 
transgenic lines 3 and 7 was quantified B. CTBGM .Ganglioside binding ELISA assays: Plates 
coated first with GM, gangliosides and BSA respectively, were plated wi th total soluble plant 
protein from lines 3 and 7, untransformed plant total soluble protein and purified bacterial CTB 
and the absorbance of the GM,ganglioside-CTB-antibody complex in eac h case was measured- 
Figure 32 shows the cloning of the psbA 5' untranslated region (5'UTR) from the 

chloroplast genome). 

Figure 33 shows the SOEing of the 5'UTR to the CTB-human proinsulin sequence. 
' Figure 34 shows a comparison of the DN A sequences of native human proinsulin and 

plastid mod ified proinsulin. 

Figure 35 show s recursive PCR to synthesize the chloroplast modified proinsulin (Ptpris). 
. 7 Figure 36 shows SOEing of the 5'UTR, CTB and plastid modified proinsulin, which 
results in the fusion of all three sequences denoted as 5CPTP. 

Figure 37 shows the PC R products to confirm construct integration into the chloroplast 
genome using two pr imers, 3P and 3M. 

Figure 3 8 shows t he Western B lot of tobacco protein extracts showing expression of 
HSA via the chloroplast genome. 

Figure 39 shows Southern Blot of HSA chloroplast transgenic plants. Untransformed 
tobacco DNA vs. transgenic tobacco DNA digested with EcoRI. 
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Figure 40 shows Northern Blot of HSA chloroplast transgenic plants using HSA probe 
(1.8kb) . 

Figure 41 shows ELISA of HSA transgenic plants . 

Figure 42A shows 1GF-I native sequence coding for the mature protein. 

Figure 42B shows IGF-I optimized sequence according to chloroplast preferred codon 

usage. 

Figure 42C shows IGF-I synthetic gene after recursive PCR. 

DETAILED DESCRIPTION 

(60/,263,668) A remarkable feature of chloroplast genetic enginee ring is the observation 
of exceptionally large accumulation of foreign proteins in transgenic plan ts. This can be as much 
as 46% of CRY protein in total soluble protein, even in bleached old leaves (DeCosa et al. 2001) . 
Stable expression of a pharmaceutical protein in chlorop lasts was first reported for GVGVP, a 
protein based polymer with varied medical applications (such as the prevention of post-surgical 
adhesions and scars, wound coverings, artificial pericardia, tissue reconstruction and 
programmed drug delivery) (Guda et al. 2000). Subsequently, expression of the human 
■ somatotropin via the tobacco chloroplast genome (Staub et al. 2000) to high levels (7% of total 
soluble protein) was observed. The following investigations that are in progress illustrate the 
power of this t echnology to express small peptides, entire operons, vaccines that require 
oligomeric proteins with stable disulfide bridges and monoclonals that require assembly of 
heavy/light chains via chaperonins. It is essential to develop a selection system free of antibiotic 
■•' resistant genes for the edible insulin approach to be successful. One such marker free c hloroplast 
transformation system has been accomplished(Daniell et al. 2000). Experiments are in progress 
to develop chloroplast transformation of edible leaves (alfalfa and lettuce) for the practical 
applications of this approach . 

(60/185,987) In our research, we use insulin as a model protein to demonstrate its 
production as a value added trait in transgenic tobacco. Most importantly, a significan t 
advantage in the production of pharmaceutical proteins in chloroplasts is their ability to proces s 
eukaryotic protein, including folding and formation of disulfide bridges (Dreshcher et al., 1998 ). 
Chaperonin proteins are present in chloroplasts (Verling 1991; Roy 1989) that function i n 
folding and assembly of prokaryotic/eukaryotic proteins. Also, proteins are activated by 
disulfide bond oxido/reduction cycles using the chloroplast inicredoxin system (Reulland and 
Miginiac- Maslow, 1999) or chloroplast protein disulfide isomerase (Kim and Mayfield, 1997) . 
* Accumulation of fully assembled, disulfide bonded form of antibody inside chloroplasts, eve n 
though plastics were not transformed (During et al. 1990), provides strong evidence for 
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(Panchai et al. 2000, in review). Such folding and assembly eliminates the need for post - 
purification processing of pharmaceutical proteins. Chloroplasts may also be isolated from crude 
homogenates by ce ntrifugation (1500 X g). This fraction is free of other cellular proteins. 

rT7 rrr^r — -f-, f . . . • • ™ " T ^ rrn ' " ~ rrr "- ! 1 • 

Isolated chloroplasts are burst open by osmotic shock to releas e foreign proteins that are 
compartmentalized in this organelle along with few other native soluble proteins (Daniel and 
McFadden,, 19.87). . 

(60/185,987) GVGVP is a PBP made from synthetic genes. At lower temperatures the 
polymers exist as more extended molecules which, on raising the temperature above the 
transition ran ge, hydrophobically fold into dynamic structures called B-spirals that further 
1 aggre gate by hydrophobic association to form twisted filaments (Urry, 1991; Urry, et al., 1994 ). 
Inve rse temperature transition offers several advantages. Expense associated with 
chro matographic resins and equipment are eliminated. It also facilitates scale up of purification 
from grams to kilograms. Milder purification conditions use only a modest change in 
temperat ure and ionic strength. This also facilitates higher recovery, faster purification and high 
volume p rocessing. Protein purification is generally the slow step (bottleneck) in pharmaceutical 
jproduct development. Through exploitation of this reversible inverse temperature transition 
property, si mple and inexpensive extraction and purification is performed. The temperature at 
which the aggregation takes place c an be manipulated by engineering biopolymers containing 
varying num bers of repeats and changing salt concentration in solution (McPherson et al., 1996). 
A; /chloroplast mediated expression of insulin-polymer fusion protein eliminates the need for the 
expensive fermentation process as well as reagents needed for recombinant protein purification 

. ^T— ■ : " ' ' * ' 

and downstream processing. 

:i * (60/185,987) Large-scale pro duction of insulin in plants in conjunction with an oral 

delivery system is a powerful approach to provide insulin to diabetes patients at an affordable 

cost and pro vide tobacco farmers alternate uses for this hazardous crop. For example, Sun et al. 

( ' (1994) showed that feeding a small dose of antigen s conjugated to the receptor binding non-toxic 

B-subunit moiety of the cholera toxin (CTB) suppressed systemic T cell-mediated inflammatory 

reactions in animals. Oral administration of a myelin antigen conjugated to CTB has been 

shown to protect animals against encephalomyelitis, even when given after disease induction 

(Sun et al. 1996). Bergerot et al. (1997) reported that feeding small amounts of human insulin 

conjugated to CTB suppressed beta cell destruction and clinical diabetes in adult non-obese 

diabetic (NOD) mice. The protective effect could be transferred by T cells from CTB-insulin 

treated animals and was associated with reduced insulitis. These results demonstrate that 

. . • — — — -- - • ■■ ! " f 

' protection against autoimmune diabetes can indeed be achieved by feeding small amounts of 

pancreas islet cell auto antigen linked to CTB (Bergerot, et al. 1997). Conjugation with CTB 
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facilitates antigen delivery and presentation to the Gut Associated Lymphoi d Tissues (GALT) 

— ; ! — — — • ■ ■ ■ ■ ^ ■-. ■J-.'-*'. 

due to its affinity for th e cell surface receptor GM-ganglioside located oh GALT cells, for 
increased uptake and immunologic recognition (Arakawa et al. 1998). Transgenic potato tubers 
expressed up to 0.1% CTB -insulin fusion protein of total soluble protein, which retained GM- 
gang lioside binding affinity and native autogenic^ 'for both CTB an : d insulin. NOD mice fed 
with' transgenic potato tubers containing microgram quantities of CTB-insulin fusion protein 
showed a substantial reduction in insulitis and a delay in the progression of diabetes (Arkawa et 
al., 1998). Howe ver, for commercial exploitation, the levels of expression need to be increased 
in transgenic plants. Therefore, we undertook the expression of CTB-insul in fusion in transgenic 
chloroplasts of nicotine free edible tobacco to increase levels of expression adequate for animal 
testing. 

~" T ~ (60/185,987) In acc ordance with one advantageous feature of this invention, we use 
poly(GVGVP) as a fusion protein to enable hyper-expression of insulin, and accomplish rapid 
one step purification of fusion peptides utilizing the inverse' temperature transition properties of 
this polymer. In another advantageous feature of this invention, we develop insulin-CTB fusion 
protein for oral delivery in nicotine free edible tobacco (LAMP 605). Both features are 

accomplished as follows: 
• a) ' Develop recombinant DNA vectors for enhanced expression of Proinsu lin as fusion 
" proteins with GVGVP or CTB via chloroplast genomes of tobacco, 

bj : ' Obta in transgenic tobacco (Petit Havana & LAMP 605) plants, 

'c) 4 Characterize transg enic expression of proinsulin polymer or CTB fusion proteins using 
molec ular and biochemical methods in chloroplasts, 
: /d7 Employ existing or modified methods of polymer purification from transgenic leaves, 

e) Analyze Mendelian or maternal inheritance of transgenic plants, 

f) Large sca le purification of insulin and comparison of current insulin purification methods 
with pol ymer-based purification method in E. coll and tobacco, 

g) Compare natural refolding chl oroplasts with in vitro processing, 

h) Characterization (yield and purity) of proinsulin produced in E. coli and transgenic 

tobacco, and 

j) Asses sment of diabetic symptoms in mice fed with edible tobacco expre ssing CTB- 
insulin fusion protein. 

(60/185,987) Di abetes and Insulin: Insulin lowers blood glucose (Oakly et al. 1973). This is a 
'' ffilt ofits i^mSte effect in Increasing glucose uptake in tissues. In muscle, under the action 
' of insulin, glucose is more readily taken up and either converted to glycogen and lactic acid or 

oxidized to carbon dioxide. Insulin also affects a number of important enzymes concerned with 
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cellular metabolism. It increases the activity of glucokinase, which phosphoryiates glucose, 
thereby increasing th e rate of glucose metabolism in the liver. Insulin also suppresses 
gluconeogenesis by depressing the function of liver enzymes, which operate the reverse pathway 
from proteins to glucose. Lack of insulin can restrict the transport of glucose into muscle and 
adipose tissue. This results in increases in blood glucose levels (hyperglycemia). In addition, 
the breakdo wn of natural fat to free fatty acids anci glycerol is increased and there is a rise in the 
fatty acid content in the blood. Increased catabolism of fatty acids by the liver results in greater 
prod uction of ketone bodies. They diffuse from the liver and pass to the muscles for further 
oxidation. Soon, ketone body production rate exceeds oxidation rate and ketosis results. Fewer 
amino aci ds are taken up by the tissues and protein degradation results. At the same time , 
gl uconeogenesis is stimulated and protein is used to produce glucose. Obviously, lack of insulin 
has serious consequences. 

(60/185,987) Diabetes is classified into types I and II. Type I is. also known as insulin 
d ependent diabetes mellitus (IDDM). Usually this is caused by a cell-mediated autoimmune 
destruction of the pancreatic P-cells (Davidson, 1998). Those suffering from this type are 
dependent on external sources of insulin. Type II is known as noninsulin-dependent diabetes 

y ■ .■!■■■ ! • ; T-7-. -tt— 7^-^ ! ' ' ' ' j j j 

mellitus (NIDD M). This usually involved resistance to insulin in combination with its 
un derproduction. These prominent diseases have led to extensive research into microbial 
production of recombinant human insulin (rHI). 

(60/185,987) Expressi on of Recombinant Human Insulin in E. coli: In 1978, two thousand 
kilograms of insulin w ere used in the world each year; half of this was used in the United State s 
(Steineretal., 1978). At that time, the number of diabetics in the US were increasing 6% every 
year ( Gunby, 1978). In 1997 - 98, 10% increase in sales of diabetes care products and 19% 
increase in insulin products have been reported by Novo Nordisk (world's leading supplier of 
insulin), making it a 7^8 billion dollar industry. Annually, 160,000 Americans are killed by 
diabetes, making it the fourth leading cause of death. Many methods of production of rHI have 
. been developed. Insulin genes were first chemically synthesized for expression in Esherichia 
coli (Crea et al., 1978). These genes encoded separate insulin A and B chains. The genes were 
each expressed in £. coli as fusion proteins with the p-galactosidase (Goeddel et al., 1979). The 
first documented production of rHI using this system was reported by David Goeddel from 
Genentech (Hall, 1988). For reasons explained later, the genes were fused to the Trp synthase 
gene . This fusion protein was approved for commercial production by Eli Lilly in 1982 (Chance 
' and Frank, 1993) with a product name of Humulin. As of 1986, Humulin was produced from 
proinsulin ge nes. Proinsulin contains both insulin chains and the C-peptide that connects them. 

•• VW 11 L -™ ~ ™ - " ' ' " ' 
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Data conc erning commercial production of Humulin and other insulin products is now 
considered proprietary information and is not available to the public. 

(60/185, 987) Delivery of Human Insulin: Insulin has been delivered intravenously in the past 
several years. However, more recently, alternate methods such as nasal spray are also available . 
Oral delivery of insulin is yet another new approach (Mathiowitz et aL, 1997)., Engineered 
polym er microspheres made of biologically erodable polymers, which display strong interactions 
with gastroint estinal mucus and cellular linings, can traverse both mucosal absorptive epithelium 
and the follicle-associated epithelium, covering the lymphoid tissue of Peyer's patches. 
Polymers maintain contact with intestinal epithelium for extended periods of time and actually 
pe netrate through and between cells. Animals fed with the poly(FA: PLGA)-encapsulated 
insulin prep aration were able to regulate the glucose load better than controls, confirming that 
insulin cro ssed the intestinal barrier and was released from the microspheres in a biologically 
active form (Mathiowitz et al., 1997). 

(60/185,987) Protein Based Polymers (PBP): The synthetic gene that codes for a bioelastic 
PBP wa s designed after repeated amino acid sequences GVGVP, observed in all sequenced 
mammalian elastin proteins (Yeh et al. 1987). Elastin is one of the strongest known natural 
fibers and is present in skin, ligaments, and arterial walls. Bioelastic PBPs containing multiple 
repeats of this pentamer have remarkable elastic properties, enabling several medical and non- 
medical applications (Urry et al. 199 3, Urry 1995, Daniell 1995). GVGVP po l ymers prevent 
adhesions following surgery, aid in reconstructing ti ssues and delivering drugs to the body over 
an extended period of time. North American Science Associates, Inc. reported that GVGVP 
polymer is non-toxic in mice, non-sensitizin g and non-antigenic in guinea pigs, and non- 
' pyrogenic in rabbits (Urry et al. 199 3). Researchers have also observed that inserting sheets of 
* GVGVP at the sites of contaminated wounds in rats reduces the number of adhesions that form 
' ' as the wounds heal (Urry et al. 1993). In a similar manner, using the GVGVP to encase muscles 
that are cut during eye surgery in rabbits prevents scarring following the operation (Urry et aL 
1993, Urry 19 95). Other medical applications of bioelastic PBPs include tissue reconstruction 
(synthetic ligaments and art eries, bones), wound coverings, artificial pericardia, catheters and 
^ programmed drug delivery (Urry, 1995; Urry et al., 1993, 1996). 

(60/1 85,9 87) We have expressed the elastic PBP (GVGVP)i 2 i in E. coli (Guda et al. 
1995, Brixey et al. 1997), in the fungus Aspergillus nidulans (Herzog et al. 1997), in cultured 
' t obacco cells (Zhang et al. 1995), and in transgenic tobacco plants (Zhang et al. 1996). In 
particular, (GVGVP)i 2 i has been expressed to such high levels in E, coli that polymer inclusion 
bodies occupied up to about 90% of the cell volume. Also, inclusion bodies have been observed 
in chloroplasts of transgenic tobacco plants (see attached article, Daniell and Guda, 1997). 
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Recently, we reported stable transformation of the tobacco chloroplasts by integration and 
expression the biopolymer gene (EG121), into the Large Single Copy region (5,000 copies per 
cell) or the Inverted Repeat region (10,000 copies per cell) of the chloroplast genome (Guda et 
at, 1999). " 

(60/185,987) PBP as Fusion Proteins: Several systems are now available to simplify protein 

— „ — , — r—. -i . . . , r~. — , d >... — ^ ^-t- rr- •■ — •■ : ' 

purification including the maltose binding protein (Marina et al. 1988), glutethione S-tranferase 
(Smith and Johnson 1988), biotinylate d (Tsao et al. 1996), thioredoxin (Smith et al. 1998) and 
cellulose binding (Ong et al.* 1989) proteins. Recombinant DNA vectors for fusion with short 

•-TT- — !T — : — rrr — : 1 ., .• \. i t y ■ — — — : f^" 5 ' < ■ = • * 

peptides are now available t o effectively utilize aforementioned fusion proteins in the 

, ■ . ' m .in - . r-pr ! ' * " ! J — ! T" ~T rm - ~> ' 

purification process (Smith et al. 1998; Kim and Raines, 1993; Su et al. 1992). Recombinant 
proteins are generally purified by affinity chromatography, using ligands specific to carrier 
proteins (Nilsson et al. 1997). Wh ile these are useful techniques for laboratory scale 
purification, affinity chromatography for lar ge-scale purification is time consuming and cost 
prohibitive. Therefor e, economical and non-chromatographic techniques are highly desirable. In 
addition, a common solution to N-terminal degrad ation of small peptides is to fuse foreign^ 

* peptides to endogenous ^ colt proteins. Early in the development of this technique, (3- 
galactosidase (p -gal) was used as a fusion protein (Goldberg and Goff, 1986). A drawback of 
this method was that the p-gal protein is of relatively high molecular weight (MW 100,000). 
Therefore, the proportion of the peptide product in the total pro tein is low. Another problem 
associated with the large P-gal fusion is early termination of translation (Burnette, 1983; Hall, 
1988). This occurred when p-gal was used to produce human insulin peptides because the 
fusion was detached from the ribosome during translatio n thus yielding incomplete peptides. 
Other proteins of lower molecular weight proteins have been used as fusion proteins to increase 
the peptide production. For example, better yields were obtained with the tryptophan synthase 
(190aa) fusion proteins (Hall, 1988; Burnett, 1983). 

(60/185,987) Accordingly, one achieve ment according to this invention is to use 
pdly(G<VGVP) as a fusion' protein Wenable hyper-ekVressio n of insulin and accomplish rapid 
^one step purification of the fusion peptide. At low er temperatures the polymers exist as more 

^ extended molecules which, on raising the temperature above the transition range, 
hydrophobically fold into dynamic structures called p-spirals that further aggregate by 
hydrophobic association to form twisted filaments (Urry, 1991). Through exploitation of this 
reversible property, sfmpfeVnd inexpen sive extraction and purification is performed. The 
temperature at which aggregation take s place (TQ is manipulated by engineering biopolymers 
containing varying numbers of repeats or changing salt concentration (McPherson et al., 1996). 
Another group has recently demonstrated purification of recombinant proteins by fusion with 
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thermally respo nsive polypeptides (Meyer and Chilkoti, 1999): Polymers of different sizes have 
been synthesized and expressed in E. coli. This approach also eliminates the need for expensive 
reagents, equipment and time required for purification. 

(60/185,987) Cholera Toxin p subunit as a fusion protein: Vibrio cholerae causes diarrhea by 
colonizing the small intestine and producing enterotoxins, of which the cholera^fbxin (GT) is 
considered the main cause of toxicity. GT is a hexameric AB 5 pr otein having one 27KDa A 
subunit which ''KasYoxicAOT activity an d a non-toxic pentamer of 1 1 .6 kDa 

B subunits that are non-coval ently linked into a very stable doughnut like structure into which^ 
the toxic active (A) subunit is inserted. The A subunit of CT consists of two fragments -Al and 
A2 which are linked by a disulfide bond. The enzy matic activity of CT is located solely on the 
Al fragment (Gill, 1976). The A2 fragment of the A subunit links the Al fragment and the B 
pentamer. CT binds via specific interactions of the B subunit : pentame r with GM1 ganglioside, 
the membrane receptor, present on the intestinal epithelial cell surface of the host. The A subunit 
is then translocated into the cell where it ADP-ribosylates the Gs subunit of adenylate cyclase 
bringing about the increased levels of cyclic AMP in affected cells that is associated with the 
electrolyte and fluid loss of clinical cholera (Lebens et al."l 994). For optimal enzymatic activity, 
the Al fragment needs to be separated from the A2 fragment by proteolytic cleavage of the main 
chain and by reduction of the disulfide bond linking them (Mekalanos et al., 1979). 

(60/185,987) The Expres sion and assembly of CTB in transgenic potato tubers has been 
reported (Araka wa et al. 1997). The CTB gene including the leader peptide was fused to an 
endoplasmic reticulum retention signal (SEKDEL) at the 3' end to sequester the CTB protein 
within the lumen of the'ER. The DNA fragment encoding the 21-amino acid leader peptide of 
the CTB protein was retained to direct the newly synthesized CTB protein into the lumen of the 
ER. Immunoblot analysis indicated that the plant derived CTB protein was antigenically 
"indistinguishable from the bacterial CTB p rotein and that oligomeric CTB molecules (Mr - 50 
'kDa) were the dominant molecular species isolated from transgenic potato leaf and tuber tissues. 
Similar t o bacterial CTB, plant derived CTB dissociated into monomers (Mr ~ 15 kDa) during 
heat acid treatment. 

(60/185,987) Enzym e linked immunosorbent assay methods indicated that plant 
:i synthes ized CTB prote in bound specifically to GM1 gangliosides, the natural membrane 
receptors of Cholera Toxin. The maximu m amount of CTB protein detected in auxin induced 
transgenic potato leaf and tuber issues was approximately 0.3% of the total soluble protein. The 
oral immunization of CD-I mice with transgenic potato tissues transformed with the CTB gene 
(administered at weekly intervals for a month with a final booster feeding on day 65) has also 
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been reported. The levels of serum a nd mucosal anti-cholera toxin antibodies in mice were 
found to generate protective immunity against the cytopathic effects of CT holotoxin. 

(60/185,987) Following intrail eal injection with CT, the plant immunized mice showed 
up to a 60 % reduction in diarrheal fluid accumulation in the small intestine. Systemic and 
mucosal CTB-s pecific antibody titers were determined in both serum and feces collected from 
immunized mice by the cl ass-specific chemiluminescent ELISA method and the endpoint titers 
for the t hree antibody isotypes (IgM, IgG and IgA) were determined. 

(60/185,987) The exte nt of CT neutralization in both Vero cell and ileal loop experiments 
suggested that anti-CTB antibodies prevent CT binding to cellular GMl-gangliosides. Also, 
mice fed with 3 g of transgenic potato exhibited similar in testinal protection as mice gavaged 
with 30 g of bacterial CTB. Recombinant LTB [rLTB] (the heat labile enterotoxin produced by 
Enterotoxigenic E. coli) which is structurally, functionally and immunologically similar to CTB 
was expressed in transgenic tobacco (Arntzen et al. 1998; Haq et al., 1995). They have reported 
that the rLTB retained its antigenicity as shown by immunoprecipitation of rLTB with antibodies 
raised to rLTB from E. coli. The rLTB protein was of the right molecular weight and aggregated 
to form the pentamer as confirmed by gel permeation chromatography. 

(60/185,987) CTB has also been demonstrated to be an effective carrier molecule for 
induction of mucosal imm unity to polypeptides to which it is chemically or genetically 
conjugated (Mc Kenzie et al, 1984; Dertzbaugh et al, 1993). The production of 
immunomodulatory transmucosal carrier molecules, such as CTB, in plants may greatly improve 
the efficacy of edible plant vaccines (Haq et al, 1995; Thanavala et al, 1995; Mason et al, 1996) 
and may also provide novel oral tolerance agents for prevention of such autoimmune diseases as 
Type 1 diabetes (Zhang et al, 1991), Rheumatoid arthritis (Trentham et al, 1993), multiple 
sclerosis (Khoury et al, 1990; Miller et al, 1992; Weiner et al, 1993) as well as the prevention of 
allergic and allograft rejection r eactions (Savegh et al, 1992; Hancock et al, 1993). 

(60/263,668) CTB, when ad ministered orally (Lebens and Holmgren, 1994), is a potent 
'mucosal immunogen, which can neutralize the toxicity of the CT holotoxin by preventing it from 
binding to the intestinal cells (Mor et al . 1998). This is believed to be a result of binding to 
eukaryo tic cell surfaces via the G M i gangliosides, receptors present on the intestinal epithelial 
surface, thus eliciting a mucosal immune response to p athogens (Lipscombe et al. 1991) and 
enhancing the immune respons e when chemically coupled to other antigens (Dertzbaugh and 
■ Eispn, 1993; Hol mgren et al. 1993; Nashar et al. 1993; Sun et al. 1994). 

Therefore, expressing a CTB-proinsulin fusio n is an ideal approach for oral delivery of 

insulin. 
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(60/185,987) Chloroplast G enetic Engineering: Several environmental problems related to 
plant genetic engineering now prohibit advancement of this technology and prevent realization of 
its full potential. One such common concern is the demonstrated escape of foreign genes 
through pollen dispersal from transgenic crop plants to their weedy relatives creating super 
weeds or causing gene pollution among other crops or toxicity of transgenic pollen to non-target 
insects such as butterflies. The high rates of gene flow from crops to wild relatives (as high as 
38% in sunflower and 50% in strawberries) are certainly a serious concern. Clearly, maternal 
inheritance (lack of chloroplast DNA in pollen) of the herbicide resistance gene via chloroplast 
genetic engineering has been shown to be a practical solution to these problems (Daniell et al, 
1998). An other common concern is the sub-optimal production of Bacillus thuringiensis (B.t.) 
insecticidal protein or reliance on a single (or similar) B.t. protein in commercial transgenic 
c rops resulting in B.t. resistance among target pests. Clearly, different insecticidal proteins . 
should be produced in lethal quantities to decrease the development of resistance. Such hyper- 
expressi on of a novel B.t. protein in chloroplasts has resulted in 100% mortality of insects that 
. are up to 40,000- fold resistant to other B.t. proteins (Kota et al. 1999). Therefore, chloroplast 
genome is an attractive target for expression of foreign genes due to its ability to express 
\ extraordinaril y high levels of foreign proteins and efficient containment of foreign genes through 
maternal inheritance. 

(60/185,987) W hen we developed the concept of chloroplast genetic engineering (Daniell 
and McFadde n, 1988 U.S" Patents; Daniell, World Patent, 1999). It was possible to introduce 
isolated intact chlor oplasts into protoplasts and regenerate transgenic plants (Carlson, 1973). 
Therefore, early investigations on chloroplast transformation focused on the development of in 
organello systems using intact chloroplasts capable of efficient and prolonged transcription and 
^translation (Daniell and Rebeiz, 1982; Daniell et al., 1983, 1986) and expression of foreign genes 
in isolated chloroplasts (Daniell and McFaddjen, 1987). However, after the discovery of the gene 
gun as a transformation device (D aniell, 1993), it was possible to transform plant chloroplasts 
' without the use of isolated' plastids and protoplasts. Chloroplast genetic engineering was 
accomplished in several phases. Transient expression of foreign genes in plastids of dicots 
(Daniell et al., 1990; Ye et al., 1990) was followed by such studies in monocots (Daniell et al., 
1991). Unique to the c hloroplast genetic engineering is the development of a foreign gene 
expression system using autonomously replicating chloroplast expression vectors (Daniell et al., 
1990). St able integration of a selectable marker gene into the tobacco chloroplast genome (Svab 
and Maliga, 1993) was also accomplished using the gene gun. However, useful genes conferring 
valuable traits via chloroplast genetic engineering have been demonstrated only recently. For 
example, plants resistant to B. t. sensitive insects were obtained by integrating the crylAc gene 
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into the tobacco chloroplast genome (McBride et al., 1995). Plants resistant to B.t. resistant 
insects (up to 40,000 fold) were obtained by hyper-expression of the cryilA gene within the 
tobacco chlorop last genome (Kota et al., 1999). Plants have also been genetically engineered via 
the chloroplast gen ome to confer herbicide resistance and the introduced foreign genes were 
maternally inherited, overcoming the p roblem .of cut-cross with weeds (Daniell et al., 1998). 
Chloroplast genetic enginee ring has also been used to produce pharmaceutical products that are 
not used by plants (Guda et al., 2000). ; Ch loroplast genetic engineering technology is currently 
being appl ied to other useful crops (Sidorov et al. 1999; Daniell. 1999). 

(60/263,668) Most transformation techniques co-introduce a gene that confers antibiotic 

. resistance, along with the gene of interest to impart a desired trait. Regenerating transformed 

cells in antibiotic containing growth media permits selection of only those cells that have 
incorporated the foreign genes. Once transgenic plants are regenerated, antibiotic resistance 
g enes serve no useful purpose but they continue to produce their gene products. One among the ^ 
primary concerns of genetic ally modified (GM) crops is the presence of clinically important 
antibiotic resistance gene products in transgenic plants that could inactivate oral doses of the 
antibiotic (reviewed by Puchta 2000; Daniell 1999A). Alternatively, the antibiotic resistant 
genes could be transferred to pathogenic microbes in the gastrointestinal tract or soil rendering 
them resistant to treatment with such antibiotics. Antibiotic resistant bacteria are one of the 
major challenges of modern medicine. In Germany, GM crops containing antibiotic resistant 
genes have bee n banned from release (Peerenboom 2000). 

" ' (60/263,668) Chl oroplast genetic engineering offers several advantages over nuclear 
transformation including high levels of gene expression and gene containme nt but utilizes 
thousands of c opies of the most commonly used antibiotic resistance genes. Engineering 
genetically modified (GM) crops without the use of antibiotic resistance^genes should eliminate 
potential risk of their transfer to the environment or, gut microbes. Therefore, betaine aldehyde 
dehydrogenase (BADH) gene from spinach is used herein as a selectable marker (Daniell et al.. 
2000). The selection process i nvolves conversion of toxic betaine aldehyde (BA) by the 
chloroplast B ADH enzyme to nontoxic glycine betaine, which also serves as an osmo protectant. 
Chloroplast transformation efficiency was 25 fold higher in BA selection than sp ectinomycin, in 
addition to rapid r egeneration (Tgblej). Transgenic shoots appeared within 12 days in 80% of 
leaf discs (up to 23 shoots per di sc) in BA selection compared to 45 days in 15% of discs (1 or 2 
" shoots per disc) on spectinomycin selection as shown in Fig. 11. Southern blots confirm stable ^ 
integration of foreign genes into all of the chloroplast genomes (~ 10,000 copies per cell) 
resulting in homoplasmy. Tr ansgenic tobacco plants showed 1527 - 1816% higher BADH 
activity at different develo pmental stages than untransformed controls. Transge nic plants were 
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morpho-Iogically in distinguishable from untransformed plants and the introduced trait was stably 
inherited in the subsequent generation. This is the first report of genetic engineering of the 
chloroplast genome without the use of antibiotic selection. Use of genes that are naturally 
present in spinach for selection, in addition to gene containment, should ease public concerns or 

; perception of GM crops. Also, this should be very helpful in the development of edible insulin. 
(60/185,987) Polymer-proinsulin Recombinant DNA Vectors: First we developed 
independent chloropl ast vectors for the expression of insulin chains A and B as polym er fusion 
peptides, as it has been produced in E. coli for commercial purposes in the past. The 
disadvantage of this method is that E. coli does not form disulfide bridges in the cell unless the 
protein is t argeted to the periplasm. Expensive in vitro assembly after purification is necessary 
for this approach. Therefore, a better approach is to express the human proinsulin as a polymer 
fusion protein. This method is better because chloroplasts are capable of forming disulfide 

' tridgesl Using a single gene, as "opposed to the individual chains, eliminates the necessity of 
conducting two parallel vector construction processes, as is needed for individual chains. In 
addition, the need for individual fermentati ons and purification procedures is eliminated by the 
single gene method. Further, proinsulin products require less processing following extraction. 
Anothe r benefit of using the proinsulin is that the C-peptide, which is an essential part the 
proinsulin pro tein, has recently been shown to play a positive role in diabetic patients (Ido et al, 
1997). 

(60/185,9 87) Recently, the human pre-proinsulin gene was obtained from Genentech, 
'■" Me: First, the pre-proinsu lin was sub-cloned into pUC19 to facilitate further manipulations. The 
next step was to design primers to make chloroplast expression vectors. Since we are interested 
in proinsulin expre ssion, the 5' primer was designed to land on the proinsulin sequence. This 
FW primer eluded the 69 bases or 23 coded a mino acids of the leader or pre-sequence of > 
preproinsulin. Also, the forward primer in cluded the enzymatic cleavage site for the protease 
factor Xa to avoid the use of cyanogen bromide. Beside t he Xa- factor, a Smal site was 
introduced to facilitate: subsequent subcloning. The order of the FW primer sequence is Smal - 
Xa-factor - Proinsulin gene. The reverse primer includes BamHl and Xbal sites, plus a short 
sequence with h omolgy with the pUC19 sequence following the proinsulin gene. The 297bp 
PCR product (Xa Pris) includ es three restriction sites, which are the Smal site at the 5'-end and 
Xbal/BamHl sites at the 3' end of the proinsuli n gene. The Xa-Pris was cloned into pCR2.1 
resulting in pCR2.1 - Xa- Pris (4.2kb). Insert ion of Xa-Pris into the multiple cloning site of 
pCR2.1, resulted in additional flanking restriction enzyme sit es that will be used in subsequent 
sub-cloning steps. A GVGVP 50-mer was generated as described previously (Daniell et al. 
1997). The ribosome binding sequence was introduced by digesting pUCs-10, which contains 
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the RBS sequence GAAGGAG, with Nool and Hind III flanking sites. The plasmid pUC 19-50 
was also digested with the same enzymes. The 50mer gene was eluted from the gel and ligated 
to pUCs-10 to produce pUCs-10-5 0mer. The ligation step inserted into the 50mer gene a RBS 
sequence and a Smal site outside the gene to facilitate subsequent fusion to proinsulin. 

(60/185,987) Another Smal partial digestion was performed to eliminate the stop codon 
of the biopolymer, transform the SOmer to a 40mer, and fuse the 40mer to the Xa-proinsulin 
sequence. The conditions for this partial digestion needed a decrease in DNA concentration and 
the 1:15 dilution of Sm al. Once the correct fragment was obtained by the partial digestion of 
Smal (eliminating the st op codon but include the RBS site), it was ligated to the Xa-proinsulin 
fusion gene resulting in the construct P CR2.1-40-XaPris. Finally, the biopolymer (40mer) - 
proinsulin fusion gene was subcloned into pSBL-CtV2 (chloroplast vector) by digesting both 
vectors with Xbal. Then the fusion gene was ligated to the pSBL-CtV2 and the final vector was 
called pSBL-OC- XaPris. The orientation of the insert was checked with Nool: one the five t 
colonies chosen had the correct orientation of th e gene. The fusion gene was also subcloned into 
pLD-CtV vector and the orie ntation was checked with EooRl and Pvuil. One of the four 
colonies had the correct orientation of the insert. This vector was called pLD-OC-XaPris 
(Fig.2A). 

(60/185,987) Both chloroplast vectors contain the 16S rRNA promoter (Prm) driving the 
selectable marker gene aadA (aminoglycoside adenyl transf erase conferring resistance to 
spectinomycin) fo llowed by the psbA 3' region (the terminator from a gene coding for 
photosystem II reaction center components) from the tobacco chloroplast genome. The only 
difference between these t wo chloroplast vectors (pSBL and pLD) is the origin of DNA 
fragments. Both pSBL and pLD are universal chloroplast expression/integration vectors and can 
be used to transform chloroplast genomes of several other plant species (Daniell et al. 1998) 
because these flanking sequences are highly con served among higher plants. The universal 
vector uses trnA and trnl genes (chloroplast transfer RNAs coding for Alanine and Isoleucine) 
from the inverted repeat region of the tobacco chloroplast gen ome as flanking sequences for 
homologous recombination as shown in Figs. 2 A and 3B. Because the universal vector 
integrates foreign genes within the Inverted Repeat region o f the chloroplast genome, it should 
double the copy numb er of insulin genes (from 5000 to 10,000 copies per cell in tobacco). 
Furthermore, it has been demonstrated that homoplasmy is achieved even in the first round of 
selection in tobacco probably because of the presence of a chloroplast origin of replication within 
the flanking sequence in the universal vector (thereby providing more templates for integration). 
Because of these and several other reasons, foreign gene expression was shown to be much 
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higher when the universal vect or was used instead of the tobacco specific vector (Guda et al., 

2000). . 

(60/185,987) DN A sequence of the polymer-proinsulin fusion was determined to confirm 
t he correct orientation of genes, in frame fusion and lac k of stop codons in the recombinant DNA 
constructs. DNA sequencing was performed using a Perk in Elmer ABI prism 373 DNA 
sequencing system using a ABI Prism Dye Termination Cycle Sequencin g Kit. The kit uses 
AmpliTaq DNA polymerase. Insertion sites at both ends were sequenced usi ng primers for each 
strand. Expression of all chloroplast vectors was first tested in E. coli be fore their use in tobacco, 
transformation because of the similarity of protein synthetic machinery (Brisey et al. 1997). For 
Escherichia coli expression XL-1 Blue strain was used. E. coli was tran sformed by standard 
CaCl 2 transformation procedures. 

(60/185,987) Expression and Purification of the Biopolymer-proinsulin fusi on protein; 

Terrific broth growth medium was inoculated with 40ul of Ampicillin (100mg/ml) a nd 40ul of 
the XL-1 Blue MRF To strain of E. coli containing pSBL-OC-XaP ris plasmid. Similar, 
inoculations were made for pLD-OC-XaPris and the negati ve controls, which included both 
plasmids containing the gene in the reverse orientation and the E. coli strain without any 
olasmid. Then. 24hr cultures were centrifuged at 13,000 rpm for 3 min . The pellets were 
resuspended in 500ul of autoclaved dH ? Q and transferred to 6ml Falcon tubes. The resuspended 
pellet was sonicated, using a High Intensity Ultrasonic processor, for 15-sec at an am plitude of 
40 and then 15 sec on ice to extract the fusion protein from cells. This sonication cycle was 
repeated 15 times. The sonicated samples were transfe rred to microcentrifuge tubes and 
centrifuged at 4°C at 10,000g for 10 min to purify the fusion protein. After centrifugation, the 
supernatant were transferred to microcentrifudge tubes and an equal volume of 2XTN buffer 
(lOOmM TrisHCI, pH 8, 100 mM NaCl) was added. Tubes were warmed at 42°C for 25 min to 
foduce biopolymer aggregation. Then the fusion protein was recov ered by centrifuging at 2,500 
i;pm at 42°C for 3 min. The recovered fusion protein was r esuspended in lOOul of cold water. 
The purification process was repeated twice. Also, the fusion protein was recovered by using 
6M Guanidine hydrochloride phosphate buffer, pH 7.0 (instead of water), to facilita te stability of 
i nsulin. New cultures were incubated for this step following the same procedure as described ' 
above, except that the pSBL-OC-XaPris expressing cells were incub ated for 24, 48 and 72 hrs. 
Cultures were centrifuged at 4,000 rpm for 12 min and the pellet was resuspended in 6M 
guanidine hydrochloride phosphate buffer, pH 7.0, and then sonicated as described above. After 
sonication, samples were run in a 16.5% Tricine gel, transferred to the nitrocellulose membrane, 
a ^id immunoblotting was performed the following day. 
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(60/185,987) A 15% glycine gel was run for 6h at recomme nded voltage as shown in Fig. 
1 1 ' Two differen t methods of extraction were used. It was observed that when the sonic extract is 
in 6M Guanicine Hyd rochloride Phosphate Buffer, pH7.0, the molecular weight changes from its 
original and correct MW 24 kD to a higher MW of approximately 30 kDa (Fig. 1C. I). This is 
probably due to th e conformation that the biopolymer takes under this kind of buffer, which is 
used to maximize the extraction of proinsulin. 

(60/185,987) The gel was first stained with 0.3M CuCl 2 and then the same gel was 
stained with Commassie R-250 Staining Solution for an ho ur and then destained for 15 min first, 
arici then overnight. CuCl 2 creates a negative stain (Lee et al. 1987). Polym er proteins (without^ 
fusion) appear as clear bands against a blue back ground in color or dark against a light 
semiopaque background (Fig. 1 A). This stain was used because other protein stains such as 
Coomassie Blue R250 does not s tain the polymer protein due to the lack of aromatic side chains 
^McPherson et al., 1992). Therefore, the observation of the 24 k6a protein in R250 stained gel^ 
Tig. IB) is due to the insulin fusion with the polymer. This observation was further confirmed 
by probing these blots with the antihuman proinsulin antibody. As anticipated, the polymer^ 
i nsulin fusion protein was observed in western blots as shown in Fig. 1C, even though the 
finding of antibody was less efficient (probably due to concealment of insulin epitopes by the 
polymer). Larger proteins observed as shown in Fig. 1C II are tetramer and hexamer complexes^ 

j ftf proinsulin. . . 

(60/185,987) It is eviden t that the insulin-polyer fusion proteins are stable in E. coli. 
^Confirming this observation, recently another lab has shown that the PBP polymer protein 
conjugates (with thioredoxin and tendamistat) undergo thermally reversible phase transition, 
retaining the transition behavior of the free polymer (Meyer and Chikoti, 1999). These results, 
clearly demonstrate that insulin fusion has not affected, the inverse temperature transition 
property of the polymer. One of the concerns is the stability of insulin at temperatures used for 
^thermally reversible purification. Temperature induced production of human insulin has been in 
commercial use (Schmidt et al. 1999). Aho/the temperature transition can be lowered by 
increasing the ionic strength of the solution during purification of this PSP (McPherson et al, 
1996). Thus, GVGVP-fusion could be used to purify a multitude of economically important 
proteins i n a simple inexpensive step. 

(60/263.668) XL-1 Blue strain of E.coli containing pLD-OC-XaPris and the negative 
Controls, which included a plasmid containing the gene in the reverse orientation and the E.coli 
strain without any plasmi d were grown in TB broth. Cell pellets were resuspended in 500 pi of 
autodaved dH 2 b or 6M Guanidihe hydrochloride phosphate buffer, pH 7.0 were sonicated and 
centrifuged at 4°C at 10,000 g for lOmin. After centrifugation, the supernatants were mixed with 



23 



1465-PCT-00 (1577-P-OO) 

an equal volume of 2XTN buffer (100 mM Tris-HCl, pH 8, 100 mM NaCl). Tubes were warmed 
at 42°C for 25 min to induce biopolymer aggregation. Then the fusion protein was recovered by 
centrifuging at 2,500 rpm at 42°C for 3 min. Samples were run in a 16.5% Tricine gel, 
transferred to the nitrocellulose me mbrane, and immunoblotting was performed. When the sonic 
extract is in 6M Guanidine H ydrochloride Phosphate Buffer, pH 7.0, the molecular weight 
changes from its original and correct MW 24 kD to a higher MW of approximately 30 kDa as 
shown in Figs. 12A and B. This is probably due to the conformation of the biopolymer in this 
buffer. 

(60/263,668) The gel was first stained with 0.3M CuCl 2 and then the same gel was 
stained w ith Commassie R-250 Staining Solution for an hour and then destained for 15 min first, 
and then overnight. CuCl 2 creates a negative stain (Lee et al. 1987). Polymer proteins (without 
fusion) appear as clear bands against a blue background in color or dark against a light 
semiopaque background as shown in Fig. 12A. This stain was used because other protein stains 
such as Coomassie Blue R250 does not stain the polymer protein due to the lack 6f aromatic side 
chains (McPherson et al., 1992). Therefore, the observation of the 24 kDa protein in R250 
stained gel as shown in Fig. 12B is due to the insulin fusion with the polymer. This observation 
was further confirmed by probing these blots with the anti-human proinsulin antibody. As 
anticip ated, the polymer insulin fusion protein was observed in western blots as shown in Figs. 
13A and B. Larger proteins observed in Figs. 13A - C are tetra mer and hexamer complexes of 
proinsulin. It is evident that the insulin-polyme r fusion proteins are stable in E.coli. Confirming 
this observation, recently others have sho wn that the PBP polymer protein conjugates (with 
thioredoxin and tendamistat) un dergo thermally reversible phase transition, retaining the 
transition b ehavior of the free polymer (Meyer and Chilkoti, 1999). These results clearly 
demonstrat e that insulin fusion has not affected the inverse temperature transition property of the 
polymer. One of the concerns is the st ability of insulin at temperatures used for thermally 
reversible purification. Temperature induced production of human insulin has been in 
commercial use (Schmidt et al. 1999). Also, the temperature transition can be lowered by 
increasing the ionic strength of the solution during purification of this PBP (McPherson et al. 
1996). Thus, GVGVP-fusion could be used to puri fy a multitude of economically important^ 
proteins in a simple inexpensive step. 

(60/185,987) Biopolymer-proinsulin fusion gene expression in chloroplast: As described in 
section d, pSBL-OC-R40XaPris ve ctor and P LD-OC-R40XaPris vectors were bombarded into 
the tobacco'chloroplasts genome vi a particle bombardment (Daniell., 1997). PCR was 
performed to confirm biopolymer-proins ulin fusion gene integration into chloroplast genome.^ 
The PCR products were examined in 0.8% agarose gels. Fig. 2A shows primers landing sites . 
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and expected PCR products. Fig. 2B shows the 1.6 kbp PCR product, confirming integration of 
the aadA gene into the chloroplast genome. This 1.6kb produc t is seen in all clones except L9, 

m , I : • ' 

which is a mutant W e used primers 2P and 2M to confirm integration of both the aadA and 
biopolymer-proinsulin fusion gene. The 1.3 kbp, product corresponds to the native chloroplast 
fragment and the 3.5 kbp product corresponds to the chloroplast genome that has integrated all 
three genes a s shown in Figs, 2C amd D. All the clones examined at this time show 
heteroplasmy, exce[t c;pmes :8d om Fog/ 2C, and S41b in Fig. 2D, which show almost 
homoplasmy. 

(60/263,668) As described i n section d, chloroplast vector was bombarded into the 
tobacco chloroplast genome via particle bombardment (Daniell, 1997). PCR and Southern Blots 
were performed to confirm biopolymer-proinsulin fusion gene integration into chloroplast 
genome. Southern blots show homoplasmy in most T 0 lines but' a few showed some 
heteroplasmy as shown in Fig. 14. Western bl ots show the expression of polymer proinsulin 
fusion protein in all transgenic lines in Fig. 13C. Quantification is by ELISA. 
^(60/185,987) Protease ia Digestion of the Biopolymer-proinsuli n fusion protein and 
Purification of Proinsulin: Factor Xa was purchased from New England Biolabs at a 
concentration of 1.0 mg/ml. The Factor Xa is supplied in 20mM HEPES, 500mM, N aCl, 2mM 
CaCl 2 , 50% glycerol, (pH 8.0). The reaction was carried out in a 1:1 ratio of fusion protein to 
reaction buffer. The reaction buffer was made with 20mM Tris-HCI, lOOmM NaCl, 2mM 
CaCl 2 , (pH 8.0). The e nzymatic cleavage of the fusion protein to release the proinsulin protein ^ 
from the (GVVP) 40 was initiated by adding the protease to the purified fusion protein at a ratio 
(ww) of approximately 1,500. This digestion was continued for 5 days with mild stirring at 4°C. 
Cleavage of the fusion protein was monitored by SDS-PAGE analysis. After the cleavage, the 
same conditions are used for purification of the proinsulin protein. The purification steps are the 
same as for the purification of the fusion protein, except that instead of recovering the pellet, the 
'supernatant is saved. We detected cleaved proinsulin in the extra cts isolated in 6M guanidine 
hydrochloride buffer as shown in Fig. 1C 11. Conditions can be estimized for complete 
cleavage. The Xa protease has been successfully used to cleave (GVGVP)2o-GST fusion 
(McPherson et al. 1992). Therefore, cleavage of proinsulin from GVGVP using the Xa protease 
does not pose problems. 

(60/263,668) The enzymatic cleavage of the fusion protein to release the proinsulin 
protein from the (GVGYP) 40 was initiated by adding the factor 10A protease to the purified 
fusion protein at a ratio (w/w) of approximately 1:500. Cleavage of the fusion protein was 
monitored by SDS-PAGE analysis. We detected cleaved proinsulin in the extracts isolated in 
6M guanidine hydrochloride buffer as shown in Figs. 13 A and B. Conditions are noweing 
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optimized for complete cleavage. The Xa protease has been successfully used previously to 
cleave (GVGVP) 2 o-GST fusion (McPherson et al. 1992). 

(60/263,668) Evaluation of chloroplast gene expr ession: (1577-P-00) A systematic approach^ 
' to identify and overcome potential limitatio ns of foreign gene expression in chloroplasts of 
transgenic plants is essential.' Information gained herein increases the utility of chloroplast 
transformation system by scientists intere sted in expressing other foreign proteins. Therefore, it^ 
is important to systematically analyze transcript ion, RNA abundance, RNA stability, rate of^ 
protein synthesis and degradation, proper folding and biologica l activity. For example, the rate of 
transcription of the introduced insulin gene may be compared with the highly expressing 
endogenous chloroplast genes (rbcL, psbA, 16S rRNA), using run on transcription assays to 
determine if the 16SrRNA promoter is operating as expected. Transgenic chloroplast containing 
each of the three constructs with different 5' regions is i nvestigated to test their transcription 
efficiency. Similarly, Vransgene RNA levels is monitored by northerns, dot blots and primer 
extension relative to end ogenous rbcL, 16S rRNA, or psbA. These results along with run on 
transcription assays should provide valuable informatio n of RNA stability, processing, etc. With 
our past experience in expression of several foreign genes, foreign transcripts appear to be 
extremely stable based on northern blot analysis. However , a systematic study is valuable to 
advance utility of this system by other scientists. 

(60/263,668) Importantly, the efficiency of translation may be tested in isolated 
chloroplasts and compared with the highly translated chloroplast protein (psbA). Pulse chase 
experiments help assess if translational pausing, premature termination occurs. Evaluation of 
percent RNA loaded on polysomes or in constructs with or without 5 ! UTRs helps determine the 
'efficiency of the ribosome binding site and 5' stem-loop translational enhancers. Codon 
optimized genes are also compared with unmodified genes to investigate the rate of translation, 

• ■■ i i . - ■ ■ ■< \ 1— ^ ~, — *— ■ -i r-WB ! ' n-T — : ■ r*~- T"^ T" 1 ! — ; ..... 

pausing and termination. In our recent experience, we observed a 200-fold difference in 
accumulation of foreign proteins due to decreases in pr oteolysis conferred by a putative 
chaperonin (De Cosa et al. 2001). Therefore, proteins from constructs expressing or not 
* expressing the putative chaperonin (with or without ORF1+ 2) provide valuable information on 
protein stability. Thus, all of this information may be used to improve the next generation of 
chloroplast vectors. 

' (60/185,987) Vector for CTB expression in chloroplasts: The leader sequence (63 bp) of the 
native CTB gene (372 bp) was deleted and a start codon (ATG) introduced at the 5' end of the 
remaining CTB gene (309 bp). Primers were designed to introduce a rbs site 5 bases upstream of 
the start codon. The 5' primer (38mer) was designed to and on the st art codon and the 5-end of 
the CTB gene. This primer had an Xbal site at the 5'-end, the rbs site [GGAGG], a 5 bp 
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breathing space followed by the first 20 bp of the CTB gene. The 3' primer (32mer) was 
designed to land on the 3' end of the CTB gene and it introduced restriction sites at the 3' end to 
facilitate subcloning. The 347 bp rCTB PCR product was subcloned into pCR2.1 resulting in 
pcC R2.1-rCTB. The final step was insertion of rCTB into the Xbal site of the universal or 
toba cco vector (pLB-CtV2) that allows the expression of the construct in E. coli and 
chioroplasts. Restriction enzyme digestion of the pLD-LH-rCTB vector with BamHl was 
performed to confirm the correct orientation of the inserted fragment in the vector. 

(60/185,987) Beca use of the similarity of protein synthetic machinery, expression of the 
chloroplast vector was tested in E. coli before its use in tobacco transformation. For Escherichia 
coli expression the Xt-1 Blue MRFtq strain was used. E. coli was transformed by standard 
CaCl 2 transformation procedures. 'Transformed E. coli (24 hrs culture and 48 hrs culture in 
100ml TB with l OOmg/ml ampicillin) and untransformed E. coli (24 hrs culture and 48 hrs 
culture in 100ml TB with 12.5mg/ml tetracycline) was then centrifuged at 10000 x g in a 
Beckman GS-15R centrifuge for 15 min. The pellet was washed with 200mM Tris-Cl twice and 
resuspended in 500ul extraction buffer (200mM Tris-Cl, pH8.0, lOOmMNaCl; lOmM EDTA, 
2mM PMSF) and then sonicated using the Autotune Series High Intensity Ultrasonic Processor. 
Then, lOOul aliquots of the sonicated transformed and untransformed cells [containing 50 - 
lOOug of crude protein extract as determined by Bradford protein assay (Bio-Rad Inc)] and 
purified CTB (Sigma C-9903) were boiled with 2X SDS sample buffer and separated on a 15% 
SDS-PAGE gel in Tris-glycine buffer (25mM Tris, 250 mM glycine, pH8.3, 0.1% SDS). The 
separated protein was then transferred to a nitrocellulose membrane by electro blotting using the 
Trans-Blot Electrophoretic Transfer Cell (Bio-Rad Inc.). 

(60/185,987) Immunoblot detection of CTB expression in E. coli: Nonspecific antibody 
reactions were blocked by incubation of the membrane.in 25ml of 5% non-fat dry milk in TBS^ 
buffer for 1 - 3 hrs on a rotary shaker (40 rpm), followed by washing in TB S buffer for 5 min. 
The membrane was then incubated for an hour with gentle agitation in 30 ml of a 1:5000 dilution 
"of rabbit anti-cholera antiseru m (Sigma C-3062) in TBS with Tween-20 [TBST] (containing 1%, 
non-fat dry milk) followed by washing 3 times in TBST buffer. The membrane was incubated 
for an hour at room temperature with gentle agitation in 30 ml of a 1: 10000 dilution of mouse 
anti-rabbit lgG conjugated with alkaline phosphatase in TBST. It was then washed thrice with 
TBST and once with TBS followed by incubation in the Alkaline Phosphatase Color 
Development Reagents, BCIP/NBT in AP color development Buffer (Bio-Rad, Inc.) for an hour . 
Immunoblot analysis snows the presence of 1 1.5 kDa polypeptide for purified bacterial CTB and 
transforme d 24h/ 48h cultures (Fig. 3A, lanes 2, 3 and 5). The 48h culture appears to express 
more CTB than that of the 24h culture indicating the accumulation of the CTB protein over time. 
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The purified bacterial CTB (45 Kda) dissociated into monomers (1 L5 KDa each) due to boiling 
prior to SDS PAGE. T hese results indicate that the pLD-LH-CTB vector is expressed in E. coli. 
Because of the similarity of the E. coli protein synthetic machinery to that of chloroplasts, 
chloroplast expression of the above vector should be possible. 

(60/185,987) CTB expression in chloroplasts; As described below, pLD-LH-CTB was 
integrated into the tobacco chloroplast genome via particle bombardment (Daniell, 1 997). PCR 
analysis was performed to confirm chloroplast inte gration. Fig. 3B shows primer landing sites 
and size of expected products. PCR analysis of clones obtained after the first round of selection 
was carried o ut as described' below. PCR products were examined on 0.8% agarose gels. The 
PCR results (Fig. 3 C) show that clones 1 and 5 that do not show any product are mutants while 
clones 2, 3, 4, 6, 7, 8, 9, 10 and 1 1 that gave a 1.65 kbp product are transgenic. As expected, 
lanes 13 - 15 did not give any PCR product, confirming that the PCR reaction was not 
contaminated. Because primers 3P & 3M land on the aadA gene and on the chloroplast genome, 
all clones that show PCR products have integrated the CTB gene and the selectable marker into 
the chloroplast genome. Clones that showed chloroplast integration of the CTB gene were 
. moved to the sec ondVround of selection to increase copy number. ' PCR' analysis of clones 
obtained after the second round of selection was also carried out? PCR results shown in Fig. 3D 
indicate that clone 5 does not give a 3 kbp product indicating that it is a mutant as observed 
earlier. Other clones give a strong 3 kbp product and a faint 1.3 kbp (similar to the 1.3 kbp 
untransformed plant product) product, indicating that they are transgenic but not yet 
homoplasmic. Complete homoplasmy can be accomplished by several more rounds of selection 
or by germinating seeds from transgenic plants on 500 jig/ml of spectinomycin. 
Vector constructions: (60/263,668) pLD vector is used for all the constructs. This vector was 
developed for chloroplast transformation. It contains the 16S rRNA promoter (Prrn) driving the 
selectable marker gene aadA (aminoglycoside adenyl transferase conferring resistance to 
s pectinomycin^ followed by the multiple cloning site and then Xho psbA 3' region (the terminator^ 
from a gene coding for photosystem II reaction center components) from the tobacco chloroplast 
genome. The pLD vector is a universal chloroplast expression /integration vector and can be 
used to transform chloroplast genomes of several other plant species (Daniell et at. 1998, Daniel^ 
: 1999) because these flanking sequences are highly conserved among higher plants. The 
universal vector uses trnA and fmf genes (chloroplast transfer RNAs coding for Alanine and y 
Isoleucine) from the inverted repeat region of the tobacco chloroplast genome as flanking 
* sequences' for Homologous the universal vector integrates foreign genes^ 

' within the Inverted Repeat region of the chloroplast ge nome, it should double the copy number ^ 
" '""of the transgene (from 5000 to 10,000 6opies per cell in tobacco). Furthermore, it has been 



28 



1465-PCT-OO (1577-P-00) 

demonstrat ed that homoplasmy is achieved even in the first round of selection in tobacco 
'probab ly because of the presence of a chloroplast origin of replication within the flanking 
' sequence in the universal vector (thereby providing more templates for integration). These, and 
■several other reasons, foreign gene expression was shown to be much higher when the universal 
vector was used instead of the tobacco specific vector (Guda et al. 2000). 
(^0/185,987) CTB-Proinsulin Vector Construction; The chloroplast expression vector pLD- 
' CTB-P roins was constructed as follows. First, both proinsulin and cholera toxin B-subunit genes 
were amplified from suitable DNA using primer sequences. Primer 1 contains the GGAGG 
' chloroplast preferred ribosome binding site five nucleotides upstream of the start codon (ATG) 
for the CTB gene and a suitable restriction enzyme site (Spel) for insertion into the chloroplast 
vector. Primer 2 eliminates the stop codon and adds the first two amino acids of a flexible hinge 
tetrapeptide GPGP as reported by Bergerot et al. (1997), in order to facilitate folding of the CTB- 
proinsulin fusion protein. Primer 3 adds the remaining two amino acids for the hinge tetra- 
peptide and eliminates the pre-sequence of the pre-proinsulin. Primer 4 adds a suitable 
restrictio n site (Spel) forsubcloning into the chloroplast vector. Amplified PCR products were 
1 inserted into the f A clonipg vector. Both the CTB and proinsulin PCR fragments were excised 
at the Smal and Xbal restriction sites. Eluted •fragments were ligated into the TA cloning vector. 
Interestingly, all white colonies showed the wrong orientation for CTB insert while three of the 
Vive blue colonies examined showed the right orientation of the CTB insert. The CTB-proinlulin 
fragment was excised at the EcoRl sites and inserted into EcoRl digested dephosphorolated pLD 
vector. Resul tant onicroplast integration expression vector,' pLD-CTB-Proins will be tested for 
expression in E. coli by western blots. After confirmation of expression of CTB-proinsulin 
fusion in E. coli, pLD-CTB-Proins will be bombarded into tobacco cells as described below. 

(60/263,668) The follo wing vectors may be designed to optimize protein expression, 
purifica tion and production of proteins with the same amino acid composition as in human 
insulin. 

a) Using tobacco plants, Eibl (1999) demonstrated, in vivo, the differences in translation^ 
' efficiency and mRNA stability of a GUS reporter gene due to various 5' and 3'^ 
un translated regions (UTR's). This already described systematic transcription and 
tran slation analysis can be used in a practical endeavor of insulin production. Consistent 
. 3 V .i with Eibl's (1999) data for increased translatio n efficiency and mRNA stability, the psbA 
5' uf R can b'e used in addition with the psbA 3' UTR already in use. The 200 bp 
tobacco chloroplast DNA fragment containing 5' psbA UTR may be amplified by PCR 
using tobacco chloroplast DNA as template. This fragment may be cloned directly in the 
pLD vector multiple cloning site downstream of the promoter and the aadA gene. The 
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cloned sequence may be exactly the same as in the psbA gene. (Update "Human 
Insulin") We have cloned the 5' untranslated region of the tobacco psbA gene including 
the promoter (S'UTR). shown in Figure 32. We performed PCR using the primers 
CCGTCGACGTAGAGAAGTCCGTATT (SEP ID NG:4) and GCCCATGGTAAAATC 
TTGGTTTATTTA (SEP ID NO:5V which resulted in a 200 base pair product, as 
expected. We inserted this PCR product into a TA cloning vector. Since restriction 
enzyme sites were not available to subclone the 5'UTR immediately upstream of the gene 
coding for the CTB-proinsulin fusion protein, we used the "SPEing" PCR technique to 
create the DNA sequence with the 5'UTR immediately upstream of the CTB-proinsulin 
gene (Figure 33). The products of this PCR include both the 5'UTR (200bp) and the 
gene for CTB-proinsulin (600bp) as additional products as well as the desired 5'UTR 
CTB-proinsulin (5CP) at 800 bp. 5 CP was eluted and then inserted into the TA cloning 
vector where DNA sequencing was performed to confirm accuracy of nucleotide 
sequence before it was subcloned into the pLD vector, 
b) Another approach of protein production in chloroplasts involves potential insulin 
crystallization for facilitating purification. The crv2Aa2 Bacillus thurinsiensis operon 
derived putative chaperonin may be used. Expression of the crv2Aa2 operon in 
chloroplasts provides a model system for hyper-expression of foreign proteins (46% of 
total soluble protein) in a folded configuration enhancing their stability and facilitating 
purification (De Cosa et al. 2001). This justifies inclusion of the putative chaperonin 
from the crv2Aa2 operon in one of the newly designed constructs. In this region there are 
two open reading frames (PRF1 and PRF2) and a ribosomal binding site (rbsY This 
sequence contains elements necessary for Crv2Aa2 crystallization, which help to 
crystallize insulin and aid in subsequent purification. Successful crystallization of other 
proteins using this putative chaperonin has been demonstrated (Ge et al. 1998V The 
PRF1 and PRF2 of the Bt Crv2Aa2 operon may be amplified by PCR using the complete 
operon as a template. Subsequent cloning, using a novel PCR technique, allows for 
direct fusion of this sequence immediately upstream of the proinsulin fusion protein 
without altering the nucleotide sequence, which is normally necessary to provide a 
restriction enzyme site (Horton et al. 1988). 

(Update "Human Insulin") Another parameter of foreign protein production to be 
investigated is post-translational. The DNA for the putative chaperonin in the Bacillus 
thurinziensis Cry 2A2 operon encodes a protein that could potentially fold and crystallize 
CTB-Proinsulin, which would allow it to accumulate in large quantities protected from 
chloroplast proteases and facilitate in subsequent purification. Standard molecular 
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biology techniques were used to insert this DNA fragment immediately upstream of the 
5'UTR of the construct containing the chloroplast optimized proinsulin. Additionally, 
another vector was constructed to contain only Shine-Dalgarno sequence (GGAGG) 
followed by the sequence encoding for the Cholera toxin B subunit and synthetic 
chloroplast optimized proinsulin fusion (CTB-PTpris). This construct will allow us to 
determine the value of the proinsuli n sequence modification both with and without the 
5'UTR. „, , . . , . . . 

c) To address codon optimization the proinsulin gene may be subjected to certain 
modifications in subsequent constructs. The plastid modified proinsulin (PtPris) can 
have its nucleotide sequence modified such that the codons are optimized for plastid 
express ion, yet its amino acid sequence remains identical to human proinsulin. PtPris is 
an ide al substitute for human proinsulin in the CTB fusion peptide. The expression of 
this construct can be compared to the native human proinsulin to determine the affects to 
codon optimization, which serve to address one relevant mechanistic parameter of 
translation. Anal ysis of human proinsulin gene showed that 48 of its 87 codons were the 
lowest frequency codons in the chloroplast for the amino acid for which they encode . 
For example, there are six d ifferent codons for leucine. Their frequency within the 
chloroplast genome ranges from 7.3 to 30.8 per thousand codons. There are 12 leucines 
in proinsulin, 8 have the lowest frequency codons (7.3), and none code for the highest 
frequency co dons (30.8). In the plastid, optimized proinsulin gene all the codons code 
for the most frequent, whereas in human proinsulin over half of the codons are the least 
frequent. Human proinsulin nucleotide sequence contains 62% C+G, whereas plastid 
optimize d proinsulin gene contain 24% C+G. Generally, lower C+G content of foreign 
genes correlates wjth higher levels of expression (Table 2). 

(Update "Human Insulin") Chloroplast foreign gene expression correlates well with %AT 
of the gene coding sequence. Th e native human proinsulin sequence is 38% AT, while 
the newly synthesized chloroplast optim ized proinsulin is 64% AT. We determined the 
optimal chloroplast coding sequence for the proinsulin (PTpris) gene by using a codon 
composition that is equivalent to the highest translated chloroplast gene, psbA. The 
prefered codon composition of psbA i n tobacco is conserved within 20 vascular plant 
species. We have compared it to the native human proi nsulin DNA sequence (Figure 34). 
Since there are too many changes for conventional mutagenesis, we employed the 
Recursive PCR method for total flene synthesis. Figure 35 shows the product of this gene 
synthesis corresponding to the 280 bp expected size. 
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This product, PTpris, was then used as a template with CTB and 5'UTR to create a fusion 
of these sequences using the SOEing PCR technique. The prodiicts of this reaction can 
be seen in figure 36. These Jnclude'yUTO(2dO bp^ CTB (320 bp), Proinsulin (280 bp), 
and CTB-Proinsulin (600 bp) as side products, and also the desired 5'UTR CTB-PTpris 
(5CPTP) at 800 bp: This was then inse rted into the TA cloning vector where the sequence 
was verified before being subcloned into the pLD vector. 



d) Another version of the proinsulin gene, mini-proinsulin (Mpris), may also have its 
^codons optimized for plastid expression, and its, amino acid sequence does not differ from 

human proinsulin (Pris). Pris 5 sequence is B Chain-RR-C Chain-KR-A Chain, whereas 
Mpris' sequence is B Chain-KR-A Chain. The MPris sequence excludes the RR-C 
Chain, which is normally excised in proinsulin maturation to insulin. The C chain of 
proinsulin is an unnecessary part of in vitro production of insulin. Proinsulin folds 
c properly and forms the appropriate disulfide bonds in the absence of the C chain. The 
remaining KR motif that exists between the B chain and the A chain in MPris allows for 
mature insulin production upon cleavage with trypsin and carboxypeptidase B. This 
construct may be used for our bio polymer fusion protein. It=s codon optimization and 
amino acid se quence is ideal for mature insulin production. 

e) Our current human proinsulin-biopolymer fus ion protein contains a factor Xa proteolytic 
cut site, which serves as a cleavage point between the biopolymer and the proinsulin. 
Currently, cleavage of the polymer-proinsulin fusion protein with the factor Xa has been 
inefficient in our hands. Therefore, we repl ace this cut site with a trypsin cut site. This 
eliminates the need for the exp ensive factor Xa in processing proinsulin. Since 
proinsulin is currently processed by trypsin in the formation of mature insulin, insulin 
maturation and fusion peptide cleavage can be achieved in a singl e step with trypsin and 
carboxypeptidase B. 

f) We observed incomplete translation products in plastids when we expressed the 120mer 
gene (Guda et al. 2000). Therefore, while expressing the polymer-proinsulin fusion 
protein, we decreased the length of the polymer protein to 40mer, without losing the 
thermal responsive property. In addition, optimal codons for glycine (GGT) and valine 
(GTA), which constitute 80% of the total amino acids of the polymer, have been used. In 
all nuclear encoded genes, glycine makes up "147/1000 amino acids while in tobacco 
chloroplasts it is 129/1000. Highly expressing genes like psbA and rbcL of tobacco make 
up 192 and 190 gly/1000. Therefore, glycine may not be a limiting factor. Nuclear genes 
use 52/1000 proline as opposed to 42/1000 in chloroplasts. However, currently used 
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codon for proline (CCG) can be modified to CCA or CCT to further enhance translation. 
It is known that pathways for proline and valine are compartmentalized in chloroplasts 
(Guda et al. 2000). Also, proline is known to accumulate in chloroplasts as an 
osmopr otectant (Daniell et al. 1994). 

g) ' Codon comparison of th e CTB gene with psbA, showed 47% homology with the most 
frequent codons of the psbA gene. Codon analysis showed that 34% of the codons of 
CTB are complimentary to the tRNA population in the chloroplasts in comparison with 
51% of psbA co dons that are complimentary to the chloroplast tRNA population. 
Because of the high levels of CTB expression in transgenic chloroplasts (Henriques and 
Daniell, 2000), there will be no need to modify the CTB gene. 

(60/263,668) DNA sequence of all con structs may be determined to confirm the correct 
orientation of genes, in frame fusion, and accurate sequences in the recombinant DNA 
constructs. DNA s equencing may be performed using a Perkin Elmer ABI prism 373 DNA 
sequencing system using a ABI Prism Dye Termination Cycle Sequencing kit. Insertion sites at 
both ends may be sequence d by using primers for each strand. 

(60/263,668) Expression of all chlorop last vectors are first tested in E.coli before their 
use in tobacco transformation because of the similarity of protein synthetic machinery (Brixley et 
al. 1997). For Escherichia coli expression XL-1 Blue strain was used. E.coli may be 
tra nsformed by a standard CaCk method. 

(Update "Human Insulin") All of the resu lting vectors, containing the desired constructs, 
were used to transform both of the tobacco cultivars, Petit Havana and LAMP 605 (edible 
tobacco). T ransformation was performed using the particle bombardment method, as described. 
Bombarded leaves ar e currently being regenerated into transgenic plants under spectinomycin 

r — : — ~ ~^ " 

selection. Several clones have begun to form shoots. The clones of Petit Havana bombarded with 
the initial CTB-human proinsulin construct have regenerated large enough for us to extract 
DNA. Extracted DNA was used as a template in a PCR reaction to confirm integration of the 
cassette into t he chloroplast genome by homologous recombination. We used two primers in this 
reaction, 3P and 3M. 3P anneals with the native chloroplast genome, while 3M anneals with the 
gene for spectinomycin resistance, aadA. The 1600 bp product of this reaction is indicative of 
integration of the construct into the genome (Figure 37). This experiment demonstrated that 7 of 
the 1 1 analyzed clones were the desired chloroplast transgenic plants. Western blots are currently 
underway to confirm expression of various CTB-proinsulin fusion proteins in E. coli. Because of 
the similarity of chloroplast and E. coli protein synthetic machinery, chloroplast vectors are 
routinely tested in our lab before bombardment. Membranes have been immunoblotted with 
antibodies to both CTB and Proinsulin. Results demonstrate the presence of the desired fusion 
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(60/185.987) Optimization of fusion gene expression: It has been reported that foreign genes 
are expressed between 5% (cryl AC. crvl 1 A) and 30% (uldA) in transgenic chloroplasts (Daniell 
1999V If the expression levels of the CTB-Proinsulin or polvmer-proinsulin fusion proteins are 
low, several approaches will be used to enhance translation of these proteins. In chloroplast 
transcriptional regulation of gene expression is less important, although some modulations by 
light and developmental conditions are observed (Cohen and Mavfield. 1997V RNA and protein 
stability appear to be less important because of observation of large accumulation of foreign 
proteins (e.g. GUS up to 30% of total protein) and tpsl transcripts 16.966-fold higher than the 
highly expressing nuclear transgenic plants. Chloroplast gene expression is regulated to a large 
extent at the post-transcriptional level. For example. 5 ! UTRs are used for optional t ranslation of 
chloroplast mRNAs. Shine-Delgarno (GGAGG) sequences as well as a stem-loop structure 
located 5' adjacent to the SD sequence are used for efficient translation. A recent study has 
shown that insertion of the psbA 5' UTR downstream of the 16S rRNA promoter enhanced 
translation of a foreign gene (GUS) hundred-fold (Eibl et al. 1999V Therefore, the 85-bp 
tobacco chloroplast DNA fragment (1595 - 1680) containing 5' psbA UTR will be amplified 
using the following primers (SEP ID NOs:6 and 7, respectively): cctttaaaaagccttccattttctattt 
gccatggtaaaat cttggtttatta. This PCR product will be inserted downstream of the 16S rRNA 
promoter to enhance translation of the proinsulin fusion proteins. 

(60/185.987) Yet another approach for enhancement of translation is to optimize codon 
compositions of these fusion protein. Since both fusion proteins are expressed well in E. colL we 
expected efficient expression in chloroplasts. However, optimizing codon compositions of 
proinsulin and CTB genes to march the psbA gene could further enhance the level of translation. 
Although rbcL (RuBisCO) is the most abundant protein on earth, it is not translated as frequently 
as the psbA gene due to the extremely high turnover of the psbA gene product. The psbA gene is 
under stronger selection for increased translation efficiency and is the most abundant thvlakoid 
protein. In addition, codon usage in higher plant chloroplasts is biased towards the NNC codon 
of 2-fold degenerate groups (i.e. TTC over TTT. GAC over GAT. CAC over CAT. AAC over 
AAT. ATC over ATT. ATA etc.V This is in addition to a strong bias towards T at third position 
of 4-fold degenerate groups. There is also a context effect that should be taken into consideration 
while modifying specific codons. The 2-fold degenerate sites immediately upstream from a 
GNN codon do not show this bias towards NNC. (TTT GGA is preferred to TTC GGA while 
TTC CGT is preferred to TTT CGT TTC AGT to TTT AGT and TTC TCT to TTT TCT) (SEP 
ID NQ.8V In addition, highly expressed chloroplast genes use GNN more frequently than other 
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genes. The web site may be used optimize codon composition by 

comparing different ^species. Abundance of amino acids in chloroplasts can be taken into 
consideration (pathways compartmentalized in plastids as opposed to those that are imported into 
j?Iastids)._ 

(60/185,987) As far as the biopolymer gene is concerned, we observed incomplete 
translation products in plastids when we expressed the 120mer gene (Guda et al. 2000). 
Therefore, while ex pressing the polymer-proinsulin fusion protein, we decreased the length of 
the polymer protein to 40mer, without losing the thermal responsive property. In addition, 
optimal codons for glycine (GGT) and valine (GTA), which constitute 80% of the total amino 
acids of the polymer, have been used. In all nuclear encoded genes glycine make up 147/1000 
amino acids while in tobacco chloroplasts it is 129/1000. Highly expressing genes like psbA and 
rbcL of tobacco make up 192 and 190 gly/1000. Therefore, glycine may not be a limiting factor. 
Nuclear genes use 52/1000 proline as opposed to 42/1000 in chloroplasts. However, currently 
used codon for proline (CCG) can be modified to CCA or CCT to further enhance translation. It 
is known that pathways for proline an d valine are compartmentalized in chloroplasts (Guda et al. 
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2000). Also, proline is known to accumulate in chloroplasts as an osmoprotectant (Daniell et aL 
1994). 

(60/263,668) We have reported that foreign genes are expressed between 3% (cry2Aa2) 

and 46% {cry2Aa2 operon) in transgenic chloroplasts (Kota et al. 1999; De Cosa et al. 2001). 

Several approaches may be used to enhance translation of the recombinant proteins. In 

chloroplasts, transcriptional regulation as a bottle-neck in gene expression has been overcome by 

utilizingtthe strong cons titutive promoter of the 16s rRNA (Prrn). One advantage of Prrn is that 

it is recognized by both the chloroplast encoded RNA polymerase and the nuclear encoded 

chloroplast RNA polymerase in tobacco (Allison et al. 1996). Several investigators have utilized 

Prrn in their studies to overcome the initial hurdle of gene expression, transcription (De Cosa et 

al. 2001, Eibl et al. 1999, Staub et al. 2000). RNA stability appears to be one among the least 

problems because of observa tion of excessive accumulation of foreign transcripts, at times 

16,966-fold higher than the highly expressing nuclear transgenic plants (Lee et al. 2000). Also, 

other investiga tions regarding RNA stability in chloroplasts suggest that efforts for optimizing 

gene expression need to be addressed at the post-transcriptional level (Higgs et al. 1999, Eibl et 

al. 1999). Our work focuses on addressing protein expression post-transcriptionally. For 

example, 5' and 3* UTRs are needed for optimal translation and mRNA stablility of chloroplast 

mRNAs (Zerges 2000). Optimal ribosomal binding sites (RBS's) as well as a stem-loop 
ji j — ' . 

structure located 5= adjacent to the RBS are needed for efficient translation. A recent study has 

y j -shown that replacement of the Shine-Delgarno (GGAGG) with the psbA 5' UTR downstream of 

i — - - — •■ — — — - • -- ■ - - - j — — - - - — — — ; — * 
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the 16S rRNA promoter enhanced translation of a foreign gene (GUS) hundred-fold (Eibl et al. 
1999). Therefore, the 200-bp tobacco chloroplast DNA fragment (1680-1480) containing 5 ' 
p^bA UTR.may be used. This PCR product is inserted downstream of the 16S rRNA promote r 
to enhance translation of the recombinant proteins. 

(60/263,668) Yet another appr oach for enhancement of translation is to optimize codon 
compositions. We have compared A+T% content of all foreign genes that had been expressed in 
transgenic chloroplasts with the percentage of chloroplast expression^ We found that higher 
levels of A+T always correlated with high expression levels (see Table 2). It is also potentially 
possible to modify chloroplast protease recognition sites while modifying codons, without 
affecting their biological functions.- Therefore, optimizing codon compositions of insulin and 
polymer genes to match the psb A gene should enhance the level of translation. Although rbcL 
(RuBisCO) is the m ost abundant protein on earth, it is not translated as highly as the psbA gene 
due to the extremely high tu rnover of the psbA gene product. The psbA gene is under stronger 
selection for increased translation efficiency and is the most abundant thylakoid protein. In 
addition, the codon usag e in higher plant chloroplasts is biased towards the NNC codon of 2-fold 
degenerate groups (i.e. TTC over TTT, GAC over GAT, CAC over CAT, AAC over AAT, ATC 
over ATT, ATA etc.). This is in addition to a strong bias towards T at the third position of 4-fold 
degenerate groups. There is also a context effect that should be taken into consideration while 
modifying specific codons. The 2-fold degenerate sites immediately upstream from a GNN 
codon do not sh ow this bias towards NNC. (TTT GGA is preferred to TTC GGA while TTC 
CGT is preferred to TTT CGT, TTC AGT to TTT AGT and TTC TCT to TTT TCT, Morton, 
1993; Morton and Bernadette, 2000). In addition, highly expressed chloroplast genes use GNN 
more frequently that other genes. The disclosure of web site http://www.kazusa.or.jp/codon and 
http://www.ncbi.nlm.nih.gov may be used to -o ptimize codon composition by comparing codon 
usage of differe nt plant species' genomes and PsbA=s genes. Abundance of amino acids in 
chloroplasts and tRNA anticodons present in chloroplast may be taken into consideration. 
Optimization of polymer and proinsulin may be performed using a novel PCR approach 
(Prodromou and Pearl, 1992; Casimiro et al. 1997), which has been successfully used in our 
laboratory to optimize codon composition of other human proteins. 

(60/185,987) Bombardment and Regeneration of Chloroplast Transgenic Plants: Tobacco 
{Nicotiana tabacum var. Petit Havana) and nicotine free edible tobacco (LAMD 665, gift from 
Dr. Keith Wycoff. Planet Biotechnology) plants are grown aseptically by germination of seeds 
on MSO medium. This medium contains MS salts (4.3 g/liter), B5 vitamin mi xture (myo- 
inositol, 100 mg/liter; thiamine-HCl. 10 mg/liter nicotinic acid. 1 mg/liter; pyridoxine-HCL. 1 
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mg/liter), sucrose (30 g/Iiter) and phytagar (6 g/Iiter) at pH 5.8. Fully expanded, dark green 
leaves of about two month old plants are used for bombardment. 

(60/185,987) Leaves are placed abaxial side up on a Whatman No. 1 filter paper laying 
on the RMOP mediu m (Daniell, 1993) in standard petri plates (100x15 mm) for bombardment. 
Tungsten (1 p.m) or Gold (0.6 ^im) microprojectiles are coated with plasmid DNA (chloroplast 
vectors) and bombardments carried out with the biolistic device PDSIOOO/He (Bio-Rad) as 
described by Daniell (1997). Following bombardment, petri plates are sealed with parafilm and 
incubated at 24°C under 12 h photoperiod. Two days after bombardment, leaves are chopped . 
into small pieces of - 5 mm 2 in size and placed on the selection medium (RMOP containing 500 
jig/ml of spectinomycin dihydrochloride) with abaxial side touching the medium in deep 
(100x25 mm) petri plates (-10 pieces per plate). The regenerated spectinomycin resistant shoots 
are chopped into small pieces (~2mm 2 ).and subcloned into fresh deep petri plates (-5 pieces per 
plate) containing the same selection medium. Resistant shoots from the second culture cycle 
arbe transferred to the rooting medium (MSO medium supplemented with 1BA. 1 mg/liter and 
spectinomycin d ihydrochloride, 500 mg/liter). Rooted plants are transferred to soil and grown at 
26°C under continuous lighting conditions for further analysis. 

(60/185,987) Polymerase Chain Reaction: PGR is performed using DNA solated from control 
and transgenic plants to distinguish a) true chloroplast transformants from mutants and b) 
chloroplast transformants from nuclear transformants. Primers for testing the presence of the 
aadA gene (that confers spectinomycin resistance) in transgenic pants are landed on the aadA 
coding sequence and 16S rRNA gene (primers 1P&1M.). To test chloroplast integration of the 
insulin gene, one primer lands on the aadA gene, while another lands on the native chloroplast 
genome (primers 3P&3M) as shown in Figs. 2A and 3B. No PCR product is obtained with 
nuclear transgenic plants u sing this set of primers. The primer set (2P & 2M, in Figs. 2A and 
3B) is used to test inte gration of the entire gene cassette without internal deletion or looping out 
during homologous recombination. A similar strategy has been used successfully to confirm 
chloroplast integration of foreign genes (Daniell et al., 1998; Kota et al, 1999; Guda et al., 1999). 
This screening is essential to eliminate mutants and nuclear transformants. 

(60/185,987) Total DNA from unbombarded and transgenic plants is isolated as 
described by Edwards et al., (1991) to conduct PCR analyses in transgenic plants. PCR reactions 
are performed in a total volume of 50 jxl containing approximately 10 ng of template DNA and 1 
□M of each primer in a mixture of 300 fxM of each deoxynucleotide (dNTPs), 200 mM Tris (pH 
^.8), 100 mM KC1, 100 mM (NH^SCU, 20 mM MgSQ 4 , 1% Triton X-100, 1 mg/ml nucle ase- 
free BSA and 1 or 2 units of Tag Plus polym erase (Stratagene, La Jolla, CA). PCR is carried out 
in the Perkin Elmer's GeneAmp PCR system 2400, by subjecting the samples to 94°C for 5 min 
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and 30 cycles of 94°C for 1 min, 55°C for 1.5 min, 72°C for 1.5 or 2 min followed by a 72°C , 
step for 7 min. PCR products are analyzed by electrophoresis on 0.8% agarose gels. Chloroplast 
transgenic pla n ts containing the proinsulin gene are then moved to second round of selection to 
achieve homoplasmy. 

(60/185,987) Southern Blot Analysis: Southern blots are performed to determine the copy 

number of the introduced foreign gene per cell as well as to test homoplasmy. There are several 

thousand copies of the chloroplast genome present in each plant cell. Therefore, when foreign 

genes are inserted into the chloroplast genome, it is possibl e that some of the chloroplast 

genomes have foreign genes integrated while others remain as the wild type (heteroplasmy). 

Therefore, to ensure that only the transformed genome e xists in cells of transgenic plants 

(homoplasmy), the selection process is continued. To confirm that the wild type genome does 

not exist at the end of the selection cycle, total DNA from transgenic plants s hould be probed^ 

with the chloroplast border (flanking) sequences (the trnl-t rnA fragment, Figs. 2A and 3B). If 

wild type genomes are present (heteroplasmy), the native fragment size is observed along with 
<— .,, — ..... — — — — ■ — ■ " — ■ — ■ ■ — — — * 

transformed genomes. Presence of a large fragment (due to insertion of foreign genes within the 
flanking sequences) and absence o f the native small fragment confirms homoplasmy (Daniell et 
al., 1998; Kota et al., 1999; Guda et al., 1999). 

(60/185,987)The copy number of the integrated gene is determined by establishing 
homoplasmy form the transgenic chloroplast genome. Tobac co chloroplasts contain . 
5000—10,000 copies of their genome per cell (Daniell et al., 199 8). If only a fraction of the 
genomes are actually transformed, the copy number, by default, must be less than 10,000. By 
establishing that in the trangenics the insulin inserted transformed genome is the only one 
present, one can establish that the copy number i s 5000-10,000 per cell. This is usually 
achieved by digesting the total DNA with a suitable restriction enzyme and probing with the 
flanking sequences that enable homologous recombination into the chloroplast genome. The 
native fragment present in the control should be abs ent in the transgenics. The absence of native 
fragment proves that only the transgenic chloroplast genome is present in the cell and there is no 
native, untransformed, chloroplast genome, without the insulin gene present. This establishes 
the homoplasmic nature of the transformants, simultaneously, thereby providing an estimate of 
5000—10,000 copies of the foreign genes per cell. 

(60/185,987) Total DNA is extracted from leaves of transformed and wild type plants, 
using the CTAB procedure outlined by Rogers and Bendich (1988). Total DNA is digested with 
suitable restriction enzymes, electrophoresed on 0.7% agarose gels and transferred to nylon 
membranes (Micron Separation Inc., Westboro, MA). Probes are labeled with 32 P-dCTP using 
the random-primed procedure (Promega). Pre-hybridization and hybridization steps are carried 
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out at 42°C for 2 h and 1 6 h, respectively. Blots are soaked in a solution containing 2X SSC and 
0.5% SDS for 5 tnin followed by t ransfer to 2X SSC and 0.1% SDS solution for 15 min at room 
temperature. Then, bl ots are incubated in hybridization bottles containing 0.1X SSC and 0.5% 
SDS solution for 30 min at 37°C followed by another step at 68°C for 30 min, with gentle 
agitation. Finally, blots are briefly rinsed in 0.1X SSC solution, dried and exposed to X-ray film . 
in the dark, 

(60/185,987) Northern Blot Analysis: Northern blots are performed to test the efficiency of 
transcription of the proinsulin gene fused with CTB or polymer genes. Total RNA is isolated 
from 150 mg of frozen leaves by using the "Rneasy Plant Total RNA Isolati on Kit" (Qiagen Inc., 
Chatsworth, CA). RNA (10-40 mg) is denatured by formaldehyde treatment, separated on a 
1.2% agarose gel in the presence of formaldehyde a nd transferred to a nitrocellulose membrane 
(MSI) as described in Sambrook et al. (1989). Probe DNA (proinsulin gene coding region) is 
labeled by the random-primed method (Promega) with 32 P-dCT isotope. The blot is pre- 
hybridized, hybridized and washed as described above for southern blot analysis. Transcript 
levels are quantified by the Molecular Analyst Program using the GS-700 Imaging Densitometer 
(Bio-Rad, Hercules, CA). 

(60/185*987) Polymer-insulin fusi on protein purification, quantitation and 
characterization: Because polym er insulin fusion proteins exhibit inverse temperature 
transition properties as shown in Figs. 1 A and B, th ey are purified from transgenic plants 
essentially following the same method for polymer purification from transgenic tobacco plants 
(Zhang et al., 1996). However, a n additional step is introduced to take advantage of the 
compartmentalization of insulin polymer fusion protein within chloroplasts. Chloroplasts are 
first isolated from crude homogena te of leaves by a simple centrifugation step at 1500Xg. This 
eliminates most of the cellular organelles and proteins (Daniell at al., 1983, 1986). Then, 
chloroplasts are burst open by resuspending them in a hypotonic buffer (osmotic shock). This is 
a significant advantage because there are fewer soluble proteins inside chloroplasts when 
compared to hundreds of soluble proteins in the cytosol. Polymer extraction buffer contains 50 
mM Tris-HCl, pH 7.5, 1% 2- mecaptoethanol, 5mM EDTA and 2mM PMSF and 0.8 M NaCl. 
The homogenate is then cent rifuged at 10,000 g for 10 min (4°C), and the pellet discarded. The 
supernatant is incubated at 42°C for 30 minutes and then centrifuged immediately for 3 minutes 
at 5,000 g (room temperature). If insulin is found to be sensitive to this temperature, Tj is 
lowered by increasing salt concentration (McPherson et al., 1996). The pellet containing the 
insulin-polymer fusion protein is resuspended in the extraction buffer and incubated on ice for 10 
minutes. The mixt ure is centrifuged at 12,000 g for 10 minute (4°C). The supernatant is then 
collected and stored at -20°C. The purified polymer insulin fusion-protein is electrophoresed in 
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a SDS-PAGE eel according to Laemml (1970) and visualized bv either staining with 0.3 M 
CuC WLee et al., 1987) or transferred to nitrocellulose membrane and probed with antiserum 
raised against the polymer or insulin protein as described below. Quantification of purified 
polymer proteins may then be carried out bv densitometry. 

(60/263,668) Because polymer insulin fusion proteins exhibit inverse temperature 
transition properties as shown in Figs. 12 and 13, thev may be purified from transgenic plants 
essentially following the same method described for polymer purification from transgenic 
tobacco plants (Zhang et al. .1996). Polymer extraction buffer contains 50 mM Tris-HCl, pH, 
7.5. 1% 2-mecaptoethanol, 5mM EDTA and 2mM PMSF and 0.8 M NaCl. The homogenate is 
then centrifiiged at 10.000 g for 10 minutes (4°C), and the pellet discarded. The supernatant is 
incubated at 42°C for 30 minutes and then centrifuged immediately for 3 minutes at 5.000 g 
(room temperature). If insulin is found to be sensitive to this temperature, T L is lowered by 
increasing salt concentration (McPherson et al., 1996). The pellet containing the insulin- 
polymer fusion protein is resuspended in the extraction buffer and incubated on ice for 10 
minutes. The mixture is centrifuged at 12.000 g for 10 minutes (4°C). The supernatant is then 
collected and stored at -20°C. The purified polymer insulin fusion-protein is electrophoresed in 
a SDS-PAGE gel according to Leammli (1970) and visualized by either staining with 0.3 M 
CuCb (Lee et al. 1987) or transferred to nitrocellulose membrane and probed with antiserum 
raised against the polymer or insulin protein as described below. Quantification of purified 
polymer proteins may be carried out bv ELIS A in addition to densitometry. 

(60/185,987) After electrophoresis, proteins are transferred to a nitrocellulose membrane 
electrophoretically in 25 mM Tris, 192mM glycine, 5% methanol (pH 8.3). The filter is blocked 
with 2% dry milk in Tris-buffered saline for two hours at room temperature and stained with 
antiserum raised against the polymer AVGVP (SEP ID NO:9) (kindly provided by the University 
of Alabama at Birmingham, monoclonal facility) overnight in 2% dry milk/Tris buffered saline. 
The protein bands reacting to the antibodies are visualized using alkaline phophatase-linked 
secondary antibody and the substrates nitroblue tetrazolium and 5-bromo-4-chloro-3-indolvl- 
phosphate (Bio-Rad). Alternatively, for insulin-polvmer fusion proteins, a Mouse anti-human 
proinsulin (IgGl) monoclonal antibody is used as a primary antibody. To detect the binding of 
the primary antibody to the recombinant proinsulin. a Goat anti-mouse IgG Horseradish 
Peroxidase Labeled monoclonal antibody (HPR) is used. The substrate used for conjugation with 
HPR is 3.3', S.S^Tetramethylbenzidine. All products are available from American Oualex 
Antibodies, San Clement. CA. As a positive control, human recombinant proinsulin from Sigma 
may be used. This human recombinant proinsulin was expressed in E. coli bv a synthetic 
proinsulin gene. Quantification of purified polymer fusion proteins is carried out by densitometry 
using Scanning 
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Analysis software (BioSoft, Ferguson, MO) installed on a Macintosh LC III computer (Apple 
Computer, Cupertino, USA) with a 160-Mb hard disk operating on a System 7.1, connected by 
SCSI interface to a Relisys RELI 2412 Scanner (Relisys, Milpitas, CA). Total protein contents 
is then determined by the dye-binding assay using reagents supplied in kit fro Bio-Rad, with 
bovine serum albumin as a standard. 

(60/185,987) Characterization of CTB expression: CTB protein levels in transgenic plants are 
determined using quantitative ELISA assays. A standard curve is generated using known 
concentrations of bacterial CTB. A 96-well microtiter plate padded with 100 pi/well of bacterial 
CTB (concentrations in the range of 10 - 1000 ng) is incubated overnight at 4°C. The plate is 
washed thrice with PBST (phosphate buffered saline containing 0.05% Tween-20). The 
background is blocked by incubation in 1% bovine serum albumin (BSA) in PBS (300 1/well) at 
37°C for 2 h followed by washing 3 times with PBST. The plate is incubated in a 1:3,000 
dilution of rabbit anti-cholera toxin antibody (Sigma C-3062) (100 fil/vyell) for 2 h at 37°C, - 
followed by washing the wells three times with PBST. The plate is incubated with a 1 :8 0,000 
dilution of anti-rabbit IgG conjugated with alkaline phoshatase (100 pi/well) for 2 h at 37°C and 
washed thrice with PBST. Then, 100 )il alkaline phosphatase substrate (Sigma Fast p- 
nitrophenyl phosphate tablet in 5 ml of water is added and the reaction stopped with 1M NaOH 
(50 fil/well) when absorbancies in the mid-range of the titration reach about 2.0, or after 1 hour, 
whichever comes first. The plate is then read at 405nm. These results are used to generate a 
standard curve from which concentrations of plant protein can be extrapolated. Thus, total^ 
soluble plant protein (concentration previously determined using the Bradford assay) in 
bicarbonate buffer, pH 9.6 (15 nMNa 2 Co 3> 35mM NaHCQ 3 ) is loaded at 100 plant ul/well and 
the same procedure as above can be repeated. The absorbance values are used to determine the 
ratio of CTB protein to total soluble plant protein, using the standard curve generated previously 
and the Bradford assay results. x2l v 

(60/185,987) Inheritance of Introduced Foreign Genes: In initial tobacco transformants, some 
are allowed to self-p ollinate, whereas others nrp nqpH in rprj procal crosses with control tobacco 
(transgenics as female acceptors and pollen donors: testing for maternal inheritance). Harvested 
s eeds (Tl) are germinated on media containing spectinomycin. Achievement of homoplasmy^ 
and mode of inheritance can be classified by looking at germination results. Homoplasmy is 
indicated by totally green seedlings (Daniell et al., 1998) while heteroplasmy is displayed by^ 
variegated leaves (lack of pigmentation, Svab & Maliga, 1993). Lack of variation in chlorophyll^ 
pigmentation among progeny also underscores the absence of position effect, an artifact of 
nuclear transformation. Maternal inheritance may be demonstrated by scie transmission of 
introduced genes via seed generated on transgenic plants, regardless of pollen source (green 
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seedlings on selectiv e media). When transgenic pollerus "used for pollination of control plants, 
resultant progeny does not contain resistance to'chemical in selective media (will appear 
bleached; Svab and Maliga, 1993). Molecular analyses confirms transmission and expression of 
introduced genes, and T2 seed is generated from those confirmed*plants by the analyses 
described above. 

(60/185,987) Comparison of Current Purification with Polymer-based Purification 
Methods: It is important to compare purification methods to test yield and purity of insulin 
produced in E. coli and tobacco. (60/263,668) Three methods may be compared: a standard 
fusion protein in E.coli, polymer proinsulin fusion protein in E.coli, and polymer proinsulin 
fusion in tobacco. Polymer proinsulin fusibrtpeptide from transgenic tobacco may be purified 
by methodology described in section c),ahd Daniell (1997). E.coli purification is performed as 
follows. One liter of each pLD containing bacteria is grown in LB/ampicillin (100 jig/ml) 
overnight and the fusion protein, either polymer-proinsulin or the control fusion protein (Cowley 
and Mackin 1997), expressed. (60/185,987) One liter of pSBL containing bacteria is grown in 
LB/ampicillin (100 fig/ml) overnight and the fusion protein expressed. Cells are harvested by 
centrifugation at 5000 X g for 10 min at 4 Q C, and the bacterial pellets resuspended in 5 ml/g (wet 
wt. Bacteria) of 100 mM Tris-HCl, pH 7.3. Lysozyme is added at a concentration of 1 mg/ml 
and placed on a rotating shaker at room temperature for 1 5 min. The lysate is subjected to probe 
sonication for two cycles of 30 s on/30 s off at 4°C. Cellular debris is removed by centrifugation 
at 1000 X g for 5 min at 4°C. Insulin polymer fusion protein is purified by inverse temperature 
transition properties (Daniell et al., 1997). Alternatively, the fusion protein is purified according 
to Cowley and Mackin (1997). The supernatant is retained and centrifuged again at 27000 X g 
for 15 min at 4°C to pellet the inclusion bodies. The supernatant is discarded and the pellet 
resuspended in 1 ml/g (original wt. Bacteria) of dH 2 0, aliquoted into microcentrifuge tubes as 1 
ml fractions, and then centrifuged at 16000 X g for 5 min at 4°C. The pellets are individually 
washed with 1 ml of 100 mM Tris-HCl, pH 8 .'5, 1M urea, 1-1 Triton X-100 and again washed 
with 100 mM Tris HC1 pH8.5, 2 M urea, 2% Trinton X-100. The pellets are resuspended in 1 ml 
of dH 2 Q and transferred to a pre-weighted 30 ml Corex centrifuge tube. The sample is 
centrifuged at 15000 X g for 5 min at 4°C, and the pellet resuspended in 10 ml/g (wet wt. pellet) 
of 70% formic acid. Cyanogen bromide is added to a final concentrat ion of 400 mM and the 
sample incubated at room temperature in the dark for 1 6 h. The reaction is stopped by 
transferring the sample to a round bottom flask and removing the solvent by rotary evaporation 
at 50°C. The residue is resuspended in 20 ml/g (wet wt. pellet) of dH 2 Q, shell frozen in a dry ice 
ethanol bath, and then lyophilized. The lyophilized protein is dissolved in 20 ml/g (wet wt. 
pellet) of 500 mM Tris-HCl, pH 8.2, 7 M urea. Oxidative sulfitolysis is performed by adding 



42 



1465-PCT-OO (1577-P-00) 

sodium sulfite and sodium tetrathionate to final concentrations of 100 and 10 mM, respectively, 

and incubating at room temperature for 3 h. This reaction is then stopped by freezing on dry ice. 

(60/185.987) Purification and folding of Human Proiiisulin: The S-sulfonated material is 

applied to a 2 ml bed of Sephadex G-25 equilibrated in 20 mM Tris-HCl, pH 8.2, 7 M urea, and 

then washed with 9 vols of 7 M urea. The collected fraction is then applied to a Pharmacia 

Mono Q HR 5 7 5 column equilibrated in 20 mM Tris-HCl, pH 8.2, 7 M urea at a flow rate of 1 

ml/min. A linear gradient leading to final concentration of 0.5 M NaCl is used to elute the bound 

material. 2 min (2 ml) fractions are collected during the gradient, and protein concentration in 

each fraction determined. Purity and mole cular mass of fractions are estimated by Tricine SDS- 

PAGE (as sh own in Fig. 2), where Tricine is used as the trailing ion to allow better resolution of 

peptides in the range of 1-1000 kDa. Appropriat e fractions are pooled and applied to a 1 .6 X 20 

cm column of Sephadex G-25 (superfine) equilibrated in 5 mM ammonium acetate pH 6.8. The 

sam ple is collected based on UV absorbance and freeze-dried. The partially purified S- 

sulfonated material is resuspended in 50 mM glycine/NaOH, pH 10.5 at a final concentration of 

2 mg/ml. p-mer -captoethanol is added at a ratio of 1.5 mol per mol of cysteine S-sulfonate and 

the sample stirred at 4°C in an open container for 16 h. The sample is then analyzed by 

reversed-phase high-performance liquid chromatography (RP-HPLC) using a Vydac C4 column 

(2.2 X 150 mm) equilibrated in 4% acetonitrile and 0.1% TFA. Adsorbed peptides are eluted. 

with a linear gradient of increasing acetonitrite concentration (0.88% per min up to a maximum 

of 48%). The remaining refolded proinsulin are centrifuged at 16000 X g to remove insoluble^ 

material, and loaded onto a semi-prepar ative Vydad C 4 column (IPX 250 mm). The bound^ 

material is then eluted as described above, and the proinsulin collected and Iyophilized. 

(60/185,987) Analysis and characterization of insulin expressed in E. coli and Tobacco: The 

purified expressed proinsulin is subjected to matrix-assisted laser desorption/ionization-time of 

flight (MALDI-TCF) analysis (as described by Co wley and Mackin, 1997), using proinsulin 

from Eli Lilly as both an internal and external standard. A proteolytic digestion is performed ^ 

using Staphylococcus aureus protease V8 to determine if the disulfide bridges have formed 
■ ■ — ■ - -- — » 

correctly naturally inside chlorop lasts or by i n vitro processing. Five ^g of both the expressed 
proinsulin and Eli Lilly's proinsulin are Iyophilized and resuspended in 50 ^1 of 250 mM NaP0 4 
pH 7.8. Protease V8 is added at a ratio of 1:50 (w/w) in experimental samples and no enzyme 
added to the controls. All samples are t hen incubated overnight at 37°C, the reactions stopped 
by freezing on dry ice, and samples stored at -20°C until analyzed. The samp les are analyzed by 
RP-HPLC using a Vydac C 4 column (2.2 X 1-50 mm) equilibrated in 4% acetonitrite and 0.1% 
TFA. Bound material is then eluted using a linear gradient of increasing acetonitrile 
concentration (0.88% per min up to a maximum of 48%). 
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(60/135,987) CTB-GM1 ganglioside binding assay: A GM1-ELISA assay is perform ed as 
/ described by Arakawa et al. (1997) to determine the affinity of plant-derived CTB for GM1- 
ganglioside. The microtiter plate is coated with monosialogangliosice-GMl (Sigma G-7641) by 
incubating the plate with 100 jxl/well of GM1 (3.0 p.g/ml) in bicarbon ate buffer, pH 9.6 at 4°C 
overnight. Alternatively, the wells are coated with 100 jil/well of BSA (3.0 jig/ml) as co ntrol. 
The plates are incubated with transformed plant total soluble protein and bacteria l CTB (Sigma 
C-9903) in PBS (100 [il/well) overnight at 4°C. The remainder of the procedure is then identical 
to the ELISA described above. 

(60/185,987) Mouse feeding assays for CTB: This is performed as described by Haq et al. 
(1995). BALB/c mice, divided into groups of five anima ls each, are faste'd overnight before 
feeding them transformed edible tobacco (that tastes like spinach) exp ressing CTB, 
< untransformed edible tobacco and purified bacterial CTB. Feedings are perform ed at weekly 
intervals (0, 7, 14 days) for three weeks. Animals are observed to confirm complete 
consumption of material. On day 20, fecal and serum samples are collected from each a nimal for 
analysis of anti-CTB antibodies. Mice are bled retro-orbital ly and the samples stored at -20°C 
until assayed. Fecal samples are collected and frozen ov ernight at -70°C, lyophilized, 
resuspended in 0.8 ml PBS (pH7.2) containing 0.05% sodium azide per 15 fecal pellets, 
centrifuged at HOOxg for 5 min and the supernatant stored at -20°C until assayed. Samples are 
then serially diluted in PBS containing 0.05% Tween-20 (PBS T) and assayed for anti-CTB IgG 
in serum and anti-CTB IgA in fecal pellets by the ELISA method, as described earlier. 
(60/185,987) Assessment of diabetic symptoms in NOD mice: T he incidence of diabetic 
symptoms is compared among mice fed with control nicotine f ree edible tobacco and those that 
express the CTB-proinsulin fusion protein. Four week old fem ale NOD mice are divided into 
two groups, each, group consisting often mice. Eac h group is fed with control or transgenic 
edible tobacco (nicotine free) expressing the CTB-proinsulin fusion gene. The feeding dosage is 
determined based on the level of expression. Starting at 10 weeks of age, the mice are monitored 
on a biweekly basis with urinary glucose test strips (Clinistix and Diastix, Bayer) for 
development of diabetes. Glycosuric mice are ble d from the tail vein to check for glycemia 
using a glucose analyzer (Accu-Check, Boehringer Mannheim). Diabetes is co nfirmed by 
hyperglycemia (>250 mg/dl) for two consecutive weeks (Ma et a!., 199 7). 

. Induction of oral tolerance: (60/263,668) Four week old female NOD mice may, for example, 
■ -be purchased from J ackson Laboratory (Bar Harbor, ME) and housed at an anim al care facility. 
The mice are divided into three groups, each group consisting of ten mice. Each grou p is fed one 
of the following nicotine free edible tobacco: untransformed, express ing CTB, or expressing 
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CTB-proinsulin fusion protein. Begmning at 5 weeks of age, each mouse is fed 3 g of nicotine 
free edible tobacco once per week until reaching 9 weeks of age (a total of five feedings). 

Antibody titer: (60/263,668) At" ten' weelcs"of age, the serum and fecal material are assayed for 
anti-CTB and anti-proinsulin antibody isotypes using the ELISA method described above. 



Assessment of diabetic symptoms in NOD mice: (60/263,668) Th e incidence of diabetic 
symptoms can be co mpared among mice fed with control nicotine free edible tobacco that 
expresses CTB and t hose that express the CTB-proinsulin fusion protein. Starting at 10 weeks of 
age, the mice are monitored on a biweekly basis with urinary glucose test'strips (Clinistix and 
Diastix, Bayer) for development of diabetes. Glycosuric mice are bled from the tail vein to 
check for glycemia using a glucose analyzer (Accu-Check, Boehringer Mannheim). Diabetes is 
confirmed by hyperglycemia (>250 mg/dl) for two consecutive weeks (Ma et al. 1997). 

EXPRESSION OF HUMAN THERAPEUTIC PROTEINS 

HUMAN SERUM ALBUMIN 
HSA is a monomeric globular protein and consists o f a single, generally nonglycosylated, 
polypeptide chain of 585 amino acids (66.5 KDa and 17 disulfide bonds) with no 
postranslational modifications. It is composed of three structurally similar globular domains and 
the disulfides are positioned in repeated series of nine loop-link-loop structures centered around 
eight sequential Cys-Cys pairs. HSA is initially syndiesized as pre-pro-albumin by the liver and 
released from the endoplasmatic reticulum after removal of the aminoterminal prepeptide of 18 
amino acids. The pro-albumin is further processed in the Golgi complex where the other 6 
aminoterminal residues of the propeptide are cleaved by a serine proteinase (12). This results in 
the secretion of the mature polypeptide of 585 amino acids. HSA is encoded by two codominant 
autosomic allelic genes. HSA belongs to the multige'ne "family of proteins ' tha t include alpha- 
fetoprotein and human group-specific component (Gc) or vitamin D-binding family. HSA 
facilitates transfer of many ligands across organ circulatory interfaces such as in the liver, 
intestine, kidney and brain. In addition to blood plasma, serum albumin is also found in tissues. 
HSA accounts for about 60% of the total protein in blood serum. In the serum of human adults, 
the concentration of albumin is 40 mg/ml. 

... Medical applications of HSA: The primary function of HSA is the maintenance of 
colloid osmotic pressure (COP) within the blood vessels. Its abundance makes it an important 
determinant of the^pharmacokinetic behavior of m any drugs. Reduced synthesis of HSA can be 

■ : ' T^* -' 
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clue to advanced liver disease, impaired intestinal absorption of nutrients or poor nutritional 

intake. Increased albumin losses can be due to kidney diseases (increased glomerular 

permeability to macromolecules in the nephrotic syndrom e), intestinal diseases (protein-losing 

enteropathies) or exudative skin disorders (burns). Catabolic states such as chronic infections, 

sepsis, surgery, in testinal resection, trauma or extensive burns can also cause hypoalbuminemia.^ 

HSA is used in therapy of blood volume disorders, for example posthaemorrhagic acute 

hypovolemia or extensive burns, treatment of dehydration states, and also for cirrhotic and 

hepatic illnesses. It is also used as an additiv e in perfusion liquid for extracorporeal circulation. 

HSA is used clinically for replacing blood volume, but also has a variety of non-therapeutic uses, 

including its role as a stabilizer in formulations for other therapeutic proteins. HSA is a stabilizer 

for biological materials in nature and is used for preparing biological standards and reference 
» . — ? 

materials. Furthermore, HSA is frequently used as an experimental antigen, a cell-culture 

; 1 

constituent and a standard in clinical-chemistry tests. 

Expression Systems for HSA; The expression and purification of recombinant HSA 
from various microorganisms has been reported previously (13-17). Saccharomyces cerevisiae 
has been used to produce HSA both intracellular, requiring denaturation and refolding prior to 
analysis (18), and by secretion (19). Secreted HSA was equivalent structurally, but the 
recombinant product had lower levels of expression (recovery) and structural heterogeneity 
compared to the blood derived protein (20). HSA was also expressed in Kluyveromyces lactis, a 
yeast with good secretary properties achieving 1 g/liter in fed batch cultures (21). Ohtani et al 
(22) developed a HSA expression system using Pichia pastoris and established a purification 
method obtaining recombinant protein with similar levels of purity and properties as the human 
protein. In Bacillus subtilis, HSA could be secreted using bacterial signal peptides (15). HSA 
production in E. coli was successful but required additional in vitro processing with trypsin to 
yield the mature protein (14). Sijmons et al. (23) expressed HSA in transgenic potato and 
tobacco plants. Fusion of HSA to the plant PR-S presequence resulted in cleavage of the 
presequence at its natural site and secretion of correctly processed HSA, that was 
indistinguishable from the authentic human protein. The expression was 0.014% of the total 
soluble protein. However, none of these methods have been exploited commercially. 

- Challenges in commercial production of HSA: Albumin is currently obtained by 
protein fractionation from plasma and is the world's most used intravenous protein, estimated at 
around 500 metric tons per year. Albumin is administered by intravenous injection of solutions 
containing 20% of albumin. The average dosage of albumin for each patient varies between 20- 
40 grams/day. The consumption of albumin is around 700 kilograms per million habitants per 
year. In addition to the high cost, HSA has the risk of transmitting diseases as with other blood- 
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derivative products. The price of albumin is about $3.7/g. Thus, the market of this protein 
approximately amounts to $ 2,600,000 per million people per year (0.7 billion dollars per year in 
USA). Because of the high cost of albumin, synthetic macromolecules (like dextrans) are used to 
increase plasma colloidosmotic pressure. 

Commercial HSA is mainly prepared from human plasma. This source, hardly meets the_ 
requirements of the world market. The availability of human plasma is limited and careful heat 
treatment of the product prepared must be performed to avoid potential contamination of the 
product by hepatitis, HIV and other viruses. The costs of HSA extraction from blood are very 
high. In order to meet the demands of the large albumin market with a safe product at a low cost, 
innovative production systems are needed. Plant biotechnology offers promise of obtaining safe 
and cheap proteins to be used to treat human diseases. 

INTERFERON ALPHA 

Interferons (IFNs) constitute a heterogeneous family of cytokines with antiviral, 
antigrowth, and immunomodulatory pro perties (24-26). Type I IFNs are acid-stable and 
constitute the first line of defence against viruses, both by displaying direct antiviral eff ects and 
'by interacting with the cytokine cascade and the immune system. Their function is to induce 
regulation of growth and differentiation of T cells. The h uman IFN-a family consists of at least 
22 intronless genes, 9 of whic h are pseudogenes and 13 expressed genes (subtypes) (27). Human 
IFN-a genes encode proteins of 188 or 189 amino acids. The first 23 amino acids constitute a 
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signal peptide, and the other 165 or 166 amino acids form the mature protein. IFN-a subtypes 
show 78-94 % homology at the nucleotide level. Presence of two disulfide bonds between Cys- 
l:Cys-99 and Cys-29:C ysl39 is conserved among all IFN-a species (28). Human IFN-a genes 
are expressed constitutively in organs of normal individuals (29,30). Individual IFN-a genes are 
differently expressed depending on the stimulus and they show restricted cell type expression 
(31). Although all IFN-a subtypes bind to a common receptor (32), several reports suggest that 
they show quantitatively distinct patterns of ant iviral, growth inhibitory and immunomodulatory 
activities (33). IFN-a8 and IFN-a5 seem to have the greatest antiviral activity in liver tumour 
cells HuH7 (33). IFN-a5 has, at least, the same antiviral ac tivity as IFN-a2 in in vitro 
experiments (unpublished data in Dr. Prieto's lab). I t has been shown recently that IFN-a5 is the 
sole IFN-a subtype expressed in normal liver tissue (34). IFN-a5 expression in patients with 
chronic hepatitis C is reduced in the liver (34) an d induced in mononuclear cells (35). 

Interferons are mainly known for their antiviral activities against a wide spectrum of 
viruses but also for their protective role against some non-viral pathogens. They are potent 
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immunomodulators, possess direct antiproliferative activities and are cytoto xic or cytostatic for a 

number of different tumour cell typ es. IFN-a is mainly employed as a standard therapy for hairy 

cell leukaemia, metastasizing carcinoma and AIDS-asso ciated angiogenic tumours of mixed 

cellularity known as kaposi sarcomas. It is also active against a number of other tumours and 

viral infections. For example, it is the current approved therapy for chronic viral hepatitis B 

(CHB) and C (CHC). The IFN-a subtype used for chronic viral hepatitis is IFN-a2. About 40% 

of patients with CHB and about 25% of patients with CHC respond to this therapy with sustained 

viral clearance. The usual doses of IFN-a are 5-10 MU (subcutaneous injection) three days per 

week for 4-6 months for CHB and 3 MU three days per week for 12 m onths for CHC. Three MU 

of IFNa2 represent approximately 15 \ig of recombin ant protein. The response rate in patients 

with chronic hepatitis C can be increased by combining IFN-a2 and ribavirin. This combination 

therapy, which considerably increases the cost of the therapy and causes some additional side 

effects, results in sustained biochemical and virological remission in about 40-50% of cases. 

Recent data suggest that pegilated interferon in weekly doses of 180 \xg can also increase the 

sustained response rate to about 40%. IFN-a5 is the only IFN-a subtype expressed in liver; this 

expression is reduced in patients with CHC and IFN-a5 seems to have one of the highest 

antiviral activity in liver tumour cells (see above). An international patent to use IFN-a5 has 

been filed by Prieto's group to facilitate commercial development (36). 

Human interferons are currently prepared in microbial systems via recombinant DNA 
■ r : ! — ■ : : 

technology in amounts which cannot be isolated from natural sources (leukocytes, fibroblasts, 
lymphocytes). Different recombinant interferon-q genes have been cloned and expressed in E. 
coli (37a,b) or yeast (38) by several groups. Gene rally, the synthesized protein is not correctly 
folded due to the lack of disulfide bridges and therefore, it remains insoluble in inclusion bodies 
that need to be splubilized and refolded to obtain the active interferon (39,40). One of the most 
efficient methods of interferon-a expression has been published recently by Babu et al. (41). In 
this method, E. coli cells transformed with interferon vectors (regulated by temperature inducible 
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promoters) were g rown in high cell density cultures; this resulted in the production of 4 g 
interferon-ct/liter of culture. Expression resulted exclusively in the form of insoluble inclusion 
bodies which were solubilized under denaturing conditions, refolded and purified to near 
homogeneity. The yield of purified interferon-a was approximately 300mg/l of culture. 
Expression in plants via the nuclear genome has not been very successful. Smirnov et al. (42) 
obtained transformed tobacco plants with Agrobacterium tumefaciens using the interferon-D 
gene under 35S CaMV promoter but the expression level was very low. Eldelbaum et al. (43) 
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showed tobacco nuclear transformation with Interferon-D and the expression level- detected was 
0.000017% of fres h weight-^ "'^ "" °U , . * \ 

The number of subjeStfe irilfected with hepatitis C virus (HCV) is estimated to be 120 
million (5 million in Euro pe and 4 million in USA). Seventy per cent of the infected people have 
abnormal liver function and about one third of these have severe viral hepatitis or cirrhosis. It 
might be estimated however that there are about 10,000-15,000 cases of chronic infection with 
hepatitis B virus (HBV) in Europ e, a slightly lower number of cases in USA. In Asia the 
prevalence of chronic HCV and HBV i nfection is very high (about 110 million of people are 
infected by HCV and about 150 millions are in fected by HBV). In Africa HCV infection is very 
prevalent. Since unremitti ng chronic viral hepatitis leads to liver cirrhosis' and eventually to liver 
cancer, the hig h prevalence of HBV and HCV infection in Asia and Africa accounts for their 
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very high incidence of hepatocellular carcinoma. Based on these data, the need for IFN-q is 
large. IFN-a2 is currently produced in microorganisms by a number of companies and the price 
of 3 MU (1 5 fig) of recombinant protein in the western market is about $25. Thus, the cost of one 
year IFN-cc2 therapy is about $ 4,000 per patient. This price makes this product unavailable for 
most of the patients in the world suffering from chronic viral hepatitis. Clearly methods to 
produce less expensive recombinant proteins via plant biotechnology innovations would be 
crucial to make antiviral therapy widely available. Besides, if IFN-a5 is more efficient than IFN- 
a2, lower doses may be required. 



INSULIN-LIKE GROWTH FACTOR-I (IGF-I) 
The Insulin-like Gro wth Factor protein, IGF-I, is an anabolic hormone with a complex 
maturation process. A single IGF-I gene is transcribed into several mRNAs by alternative 
splicing and use of different transcription initiation sites (44-46). Depending on the choice of 
splicing, two immature proteins are produced : IGF-IA, expressed in several tissues and IGF-IB, 
mostly expressed in liver (45). Both pre-proteins produce the same matu re protein. A and B 
Jmmature forms have different lengths and composition, as their termini are modified post- 
translationally by glycosylation. However, these ends are processed in the last step of 
maturation. Mature IGF-I protein is secreted,- not glycosylated and has three disulfide bonds, 70 
amino acids and a molecula r weight of 7.6 kD (47-49). Physiologically, IGF-I expression is 
induced by growth hormone (GH). Actually, the knock out o f IGF-I in mice has shown that 
several functions attributed originally to GH are in fact mediate d by IGF-I. GH production by 
^adenohypofisis is repressed by feed-back inhibition of IGF-I. GH induces IGF-I synthesis in 
^different tissues, but mostly in liver, where 90% of IGF-I is produced (48). The IGF-I receptor is 
^expressed in different tissues. It is formed by two polypeptides: alpha that interacts with IGF-I 
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and beta involved in signal transduction and, also present-in therrinstiliri 1 rbdfeptor (50,51). Thus, 

IGF-I and insulin activation are similar; ^ . *" 

IGF-I is a potent multifunctional anabolic hormone produced in the liver upon 

stimulation by growth hormone (GH). In liver cirrhosis the reduction of receptors for G H in 

hepatocytes and the diminished synthesis of the liver parenchyma cause a progre ssive fall of 

serum IGF-I levels. Patients with liver cirrhosis have a number of systemic derra ngements such 

as muscle atrophy, osteopenia, hypogonadism, protein-calorie malnutrition which could be 

related to reduced levels of circulating IGF-I. Recent studies from Prieto's laboratory have 

r — — - — — — — ' ; : 

^demonstrated that treatments with low doses of IGF-I induce significant improvements in 

nutritional status (52), intestinal absorption (53-55), osteopenia (56), hypogonadism (57) and 

Jiver function (58) in rats with experimental liver cirrhosis. These data support that IGF-I 

deficiency plays a pathogenic role in several systemic complications occurring in liver cirrhosis. 

The liver can be considered as an endocrine gland synthesising a hormone such as IGF-I with 

important physiological functions. Thus liver cirrhosis should, be viewed as a disease 

accompanied by a hormone deficiency syndrome for which replacement therapy with IGF-I is 

warranted. Clinical studies are in progress to ascertain the role of IGF-I in the management of 

cirrhotic patients. IGF-I is also being currently used for Laron dwarfism treatment. These 

patients lack liver GH receptor so IGF-I is not expressed (59). Also IGF-I, acting as a 

hy poglycemias, is given together with insulin in diabetes mellitus (60,61). Anabolic effects of 

JGF-I are used in osteoporosis treatment (62,63) hypercatabolism and starvation due to burning 

and HIV infection (64,65). Unpublished studies indicate that IGF-I could also be used in patients 

with articular degenerative disease (osteoarthritis). 

The potency of IGF-I has encouraged a great number of scientists to try IGF-I expression 

in various microorganisms due to the small amount present in human plasma. Production of IGF- 

I in yeast was shown to have several disadvantages like low fermentation yields and risks of 

obtaining undesirable glycosylation in these molecules (66). Expression in bacteria has been the 

most successful approach, either as a secreted form fused to protein leader sequences (67) or 

\ - 

JFused to a solubilized affinity fusion protein (68). In addition, IGF-I has been produced as 
insoluble inclusion bodies fused to protective polypeptides (69). Sun-Ok Kim and Young Lee 
(70a) expressed IG F-I as a truncated beta-galactosidase fusion protein. The final purification 
yielded approximately 5 mg of IGF-I having native conformation per liter of bacterial culture. 
IGF-I has also been expressed in animals. Zinovieva et al. (70b) reported an expression of 0.543 
mg/ml in rabbit milk. 

IGF-I circulates in plasma in a fairly high concentration varying between 120-400 ng/ml. 
In cirrhotic patients the values of IGF-I fall to 20 ng/ml and frequently to undetectable levels. 
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Replacement therapy with IGF-I in liver cirrhosis requires administration of 1.5-2 mg per day for 
each patient. Thus, every cirrhotic patient will consume about 600 mg per year. IGF-I is 
currently produced in bacteria (71). The high amou nt of recombinant protein needed for IGF-I 
replacement therapy in patients with liver cirrhosis will make this treatment exceedingly 
expensive if new methods for cheap production of recombinant proteins are not developed! 
Resides, as described above, IGF-I is used in treatment of dwarfism, diabetes, osteoporos is, 
starvation and hypercatabolism. IGF-I use in osteoarthritis is currently being investigated. 
Again, plant biotechnology could provide a solution to make economically feasible the 
application of IGF-I therapy to all these patients. 



SUMMARY OF THE INVENTION 

The present invention develops recombinant DNA vectors for enhanced expression of 
hfriman serum albumin, in sulin-like growth factor I, and interferon-a 2 and 5, via 
chloroplast genomes of tobacco, 

optimizes processing and purification of pharmaceutical proteins using chloroplast vectors in E. 
coli, and 

obtains transgenic tobacco plants. 

T he transgenic expression of proteins or fusion proteins is characterized using molecular ancj 

biochemical methods in c hloroplasts. 

Existing or modified methods of purification are employed on transgenic leaves. 

Mendelian or maternal inheritance of transgenic plants is analyzed. 

Large scale purification of therapeutic proteins from transgenic tobacco and comparison of. 

' — — L-j ■• 

current purification methods in E.coli or yeast is performed, and 

— - — . 1 

natural refolding in chloroplasts is compared with existing in vitro processing methods; 

Comparison/characterization (yield and purity) of therapeutic proteins produced in yeast or 
" " 1,1 ^— , 

E.coli with transgenic tobacco chloroplasts is performed, as are 

I n vitro and in vivo (pre-clini cal trials) studies of protein biofunctionality. 

DETAILED DESCRIPTION OF THE TNVF.NTTON 
Chloroplast genetic engineering: When the concept of chloroplast genetic engineering 
was developed (72,73), it was possible to intro duce isolated intact chloroplasts into protoplasts 
and regenerate transgenic plants (74). Therefore, early investigations on chloroplast 
transformation focu sed on the development of in organello systems using intact chloroplasts 
capable of efficient and prolonged transcription and translation (75-77) and expression of foreign 
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genes in isolated chloroplasts ( 78), However, after the discovery of the gene gun as a 
transformation device {19), it was p ossible to transform plant chloroplasts without the use of 
isolated plastids and protoplasts. Chloroplast genetic engineering was accomplished in several 
phases. Transient expression of foreign genes in plastids of dicots (80,81) was followed by such 
studies in monocots (82). Unique to the chloroplast genetic engineering is the development of a 
foreign gene expression system using autonomously replicatin g chloroplast expression vectors 
(80). Stable integration of a selectable marker gene into the tobacco chloroplast genome (83) was 
also accomplished using the gene gun. However, use ful genes conferring valuable traits via 
chloroplast genetic engineering have been demonstrated only recently. For example, plant^ 
resistant to B.t. sensitive insects were obtained by integrating the cry I Ac gene into the tobacco 
chloroplast genome (84). Plants resistant to B.t. resistan t insects (up to 40,000 fold) were 
obtained by hyper-expressio n of the cry2A gene within the tobacco chloroplast genome (85). 
Plants have also been genetically engineered via the chloroplast genome to confer herbicide 
resistance and the introduced foreign genes were maternally inherited, overcoming the problem 
of out-cross with weeds (86). Chloroplast genetic engineering technology is currently b eing 
applied to other useful crops (73,87). 

A remarkable feature of chl oroplast genetic engineering is the observation of 
exceptionally large accumulation of for eign proteins in transgenic plants, as much as 46% of 
CRY protein in total soluble protein, e ven in bleached old leaves (3). Stable expression of a 
pharmaceutical protein in chlor oplasts was first reported for GVGVP, a protein based polymer 
with varied medical applications (such as the prevention of post- surgical adhesions and scars, 
wound coverings , artificial pericardia, tissue reconstruction and programmed drug delivery (88)). 
Subsequent ly, expression of the human somatotropin via the tobacco chloroplast genome (9) to 
liign levels (/Yo ot toxai soiuoie proLeinJ WaS dl)ierve%lhe following investigations that are in 
progress in the Darn ell laboratory illustrate the power ot this technology to express small 
^ peptides, entire operons, vaccines that re quire oligomeric proteins with stable disulfide bridges 
and monoclonals that require assembly of heavy/li ght chains via chaperonins. 

Engineering novel pathways via the chloroplast: In plant and animal cells, nuclear 
mRNAs are translated monocistronically. This poses a serious problem when engineering 
multiple genes in plants (91). Therefore, in order to express th e polyhydroxybutyrate polymer or 
Guy's 13 antibody, single genes were first introduced into individual transgenic plants, then 
these plants were back-crosse d to reconstitute the entire pathway or the complete protein (92,93). 
Similarly, in a seven y ear long effort, Ye et al. (81) recently introduced a set of three genes for a 
short biosyntheticpathway that resulted in. P-carotene expression in rice. In contrast, most 
chloroplast genes of higher plants are cotranscribed (91). Expression of polycistrons via the 
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chloroplast genome provides a unique opportunity to express entire pathways in a single 

transformat ion event. The Bacillus thuringiensis (Bt) crylAal operon has recently been used as 

a model system to demonstrate operon expression and crystal formation via the chloroplast 

genome (3). Cry2Aa2 is the distal gene of a three-gene operon. The ^//immediately upstream of 

cry2Aa2 codes for a putative chaperonin that facilitates the folding of cry2Aa2 (and other 

— - — — — ^ 

proteins) to form proteolytically stable cuboidal crystals (94). 

Therefore, the cry2Aa2 bacterial operon was expressed in tobacco chloroplasts to test th^ 

resultant transgenic plants for increased expression and improved persistence of the accumulate^ 

insecticidal protein(s). Stable foreign gene inte gration was confirmed by PCR and Southern blot 

analysis in T 0 and Ti transgenic plants. Cry2Aa2 operon derived protein accumulated at 45.3% 

of the total soluble protein in mature leaves and remained stable even in old bleached leaves 

(46.1%)(Figure 15). This is the highest level of foreign gene expression ever reported in 

transgenic plants. Exceedingly difficult to control insects (10-day old cotton bollworm, 

beetarmy worm) were killed 100% after consuming transgenic leaves. Electron micrographs 

showed the presence of the insecticidal protein folded into cuboidal crystals similar in shape to 

Cry2Aa2 crystals observed in Bacillus thuringiensis (Figure 16). In contrast to currently 

marketed transgen ic plants with soluble CRY proteins, folded protoxin crystals will be processed 

" — ■ — ' / 

only by target insects that have alkaline gut pH; this approach should improve safety of Bt 

transgenic plants. Abs ence of insecticidal proteins in transgenic pollen eliminates toxicity to non- 

target insects via pollen. In addition to these environmentally friendly approaches, this 

observation should serve as a model system for large-scale production of foreign proteins within 

chloroplasts in a folded configuration enhancing their stability and facilitating single step 

purification. This is the first demonstration of expression of a bacterial operon in transgenic 

plants and opens the door to engineer novel pathways in plants in a single transformation event. 

Engineering small peptides via the chloroplast genome: It is common knowledge that 

the medical community has been fighting a vigorous battle against drug resistant pathogenic 

bacteria for years. Cationic antibacterial peptides from mammals, amphibians and insects have . 

gained more attention over the last decade (95). Key features of these cationic peptides are a net 

positive charge, an affinity for negatively-charged prokaryotic membrane phospholipids over 

neutral-charged eukaryotic membranes and- the ability to form aggregates that disrupt the*< 

bacterial membrane (96). 

There are three major peptides w ith ot-helical structures, cecropin from Hyalophora 

cecropia (giant silk moth), magainins from Xenopus laevis (African frog) and defensins from 

mammalian neutrophils. Magainin and its analogues have been studied as a broad-spectrum 

topical agent, a systemic antibiotic; a wound-healing stimulant; and an anticancer agent (97). We 
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have recently obs erved that a synthetic lytic peptide (MS 1-99, 22 amino acids) can be 

successfully expressed in toba cco chloroplast (98). The peptide retained its lytic activity against 

the phytopathogenic bac teria Pseudomonas syringae and multidrug resistant human pathogen, 

Pseudomonas aeruginosa. The anti-microbial peptide (AMP) used in this study was an 

amphipathi c alpha-helix molecule that has an affinity for negatively charged phospholipids 

commonly found in the outer-membrane of bacteria. Upon contact with these membranes, 

. " — ■ — — * 

individual peptides aggregate to form pore s in the membrane, resulting in bacterial lysis. 

Because of the concentration d ependent action of the AMP, it was expressed via the chloroplast 

genome to accomplish high dose delivery at the point of infection. PGR products and Southern 

blots confirmed chloroplast integration of the foreign genes and homoplasmy. Growth and 

development of the transgenic plants was unaffected by hyper-expression of the AMP within 

chloroplasts. In vitro assays with To and Ti plants confirmed that the AMP was expressed at 

high levels (21.5 to 43% of the total soluble protein) and retained biological activity against 

Pseudomonas syringae, a major plant pathogen. In situ assays resulted in intense areas of 

necrosis around the point of infection in control leaves, while transformed leaves showed no 

signs of necrosis (200-800 jag of AMP at the site of infection)(Figure 17). Ti in vitro assays 

against Pseudomonas aeruginosa (a multi-drug resistant human pathogen) displayed a 96% 

inhibition of growth (Figure 18). These results give a new option in the battle against 

* phytopathogenic and drug-resistant human pathogenic bacteria. Small peptides (like insulin) are 

degra ded in most organisms. However, stability of this AMP in chloroplasts opens up this 

compartment for expression of hormones and other small peptides. 



Expression of cholera toxin p subunit oligomers as a vaccine in chloroplasts 

Vibrio cholerae^ which causes acute watery diarrhea by colonizing the small intestine and 
producing the enterotoxin, cholera toxin (CT). Cholera toxin is a hexameric AB5 protein* 1 
consisting of one toxic 27kDa A subunit having ADP ribosyl transferase activity and a nontoxic 
pentamer of 1 1.6 kDa B subunits (CTB) that binds to the A subunit and facilitates its entry into 
the intestinal epithelial cells. CTB when administered orally (99) is a potent mucosal immunogen 
which can neutralize the toxicity of the CT holotoxin b y preventing it from binding to the 
intestinal cells (100). This is believed to be a r esult of it binding to eukaryotic cell surfaces via 
the Gty[\ gangliosides, receptors present on the intestinal epithelial surface, thus eliciting a 
mucosal immune re sponse to pathogens (101) and enhancing the immune response when 
chemically coupl ed to other antigens ( 1 02- 1 05). 

Cholera toxin (CTB) has previously been expressed in nuclear transgenic plants at levels 
of 0.01 (leaves) to 0.3% (tubers) of the total soluble protein. To increase expression levels, we 
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engineered the chloroplast genome to express th e CTB gene (10). We observed expression of 
oligomeric CTB at leve ls of 4-5% of total soluble plant protein (Figure 19A). PCR and Southern 
Blot analyses confirmed stable integratio n of the CTB gene into the chloroplast genome. 
Western blot analysis showed that transgenic chloroplast expressed CTB was antigenically 
identical to commercially available purified CTB antigen (F igure 20). Also, Gjyn-ganglioside 
binding assays confirm that chloroplast synthesized CTB binds to the intestinal membrane 
receptor of cholera toxin (Figure 19B). Transgeni c tobacco plants were morphologically 
indistinguishable from untransformed plants and the introduced gene was found to be stably 
. inherited in the subsequent genera tion as confirmed by PCR and Southern Blot analyses. The T 
increased production of an efficient tra nsmucosal carrier molecule and delivery system, like 
CTB, in chloroplasts of plants makes plant ba sed oral vaccines and fusion proteins with CTB 
needing oral administration, a much more feasible approach.ljThis also esta blishes unequivocally 
that chloroplasts are capable of forming disulfide bridges to assemble foreign proteins\ ^ ' 

Expression and assembly o f monoclonals in transgenic chloroplasts 

( ■ ■ ' ■ - ■ • ; — 

Dental caries (cavities) is probably the most prevalent' disease of humankind. 
Colonization of teeth by S. mutans is the single most important risk factor in the development of 
d ental caries. S. mutans is a non-motile, gram positive coccus. I t colonizes tooth surfaces and 
synthesizes glucans (insoluble polysaccharide) and fructans from sucrose using the enzymes 
glucosyltransferase and fructosyltransferase respectively (106 a). The glucans play an important 
role by allowing the bacteri um to adhere to the smooth tooth surfaces. After its adherence, the 

bacterium ferments sucrose and produces lactic acid. Lactic acid dissolves the minerals of the 

tooth, producing a cavity. ' " " " 

A topical monoclonal antibody therapy to prevent adherence of S. mutans to teeth has 
recently been developed. The incidenc e of cariogenic bacteria (in humans and animals) and 
dental caries (in animals) was dram atically reduced for periods of up to two years after the 
c essation of the antibody therapy. No adverse events w ere detected either in the exposed animals 
or in human volunteers (106b). The annual requirement for this a ntibody in the US alone may 
eventually exceed 1 metric ton. Therefore, this antibody w as expressed via the chloroplast 
genome to achieve higher levels of expression an d proper folding (11). The integration of 
antibody genes into the chloroplast genome was c onfirmed by PCR and Southern blot analysis. V 
The expression of both heavy a nd light chains was confirmed by western blot analysis under 
reducing conditions (Figure 21A,B). The expression o f fully assembled antibody was confirmed 
by western blot analysis under non-reducing conditions (Figure 21C). This is the first report of 
successful assembly of a multi-subunit human protein in transgenic chloroplasts. Production of 
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monoclonal antibodies at agricultural level should reduce their cost and create new applications 
of monoclonal antibodies. a 

HUMAN SERUM ALBUMIN 

Nuclear transformation 

The human HSA cDNA was cloned from human liver cells and the patatin promoter^ 
(whose expression is tuber specific (107)) fused along with the leader sequence of PIN II 
(proteinase II inhibitor potato transit peptide that directs HSA to the apoplast (108)). Leaf discs 
of Desiree and Kennebec potato plants were transformed using Agrobacterium tumefaciens. A 
total of 98 transgenic Desiree clones and 30 Kennebec clones were tested by PCR and western 
blots. Western blots showed that the recombinant albumin (rHSA) had been properly cieavea oy 
the proteinase II inhibitor transit peptide (Figure 22). Expression levels of both cultivars were 1 

very different among all transgenic clones as expected (Figure 23), probably because of position 
* - 

effects and gene silencing (89,90). The population distribution was similar in both cultivars: 
^ i 

majority of transgenic clones showed expression levels between 0.04 and 0.06% of rHSA in the 

total soluble protein. The maximum recombinant HSA amou nt expiessed was 0.2%. Between — 
one and five T-DNA insertions per tetraploid genome were observed in these clones. FlUIllS with — 
higher protein expression were always clones with several copies of the HSA gene . Levels ot 
mRNA were analyzed by Northern blots. There was a correlation between transcript levels and 
recombinant albumin accumulation in transgenic tubers. The N-termin al sequence showed 
proper cleavage of the transit peptide and the amin o terminal sequence between recombinant and 
human HSA was id entical. Inhibition of patatin expression using the antisense technology did 
not improve the amount of rHSA. Average expression level among 29 trans genic plants was 
0.012% of total soluble protein, with a maximum expression of 0.1%.. 

Transformation of the tobacco chloropl ast genome was initiated for hyperexpression of 
HSA. The codon composition is ideal for chloroplast expression and no changes in nucleotide 
sequences were necessary. For all the constructs pLD vector was used. Several vectors were^ 
designed to optimize HSA expression. All these contained ATG as the first amino acid of the 
mature protein. 
RBS-ATG-HSA 

The first vector i ncluded the gene that codes for the mature HSA plus an additional ATG 
as a translation initiation codon. We included the ATG in one of the primers of the PCR, 5 
nucleotides downstream of the chloroplast preferred RBS sequence GGAGG. The cDNA 
sequence of the mature HSA (cloned in Dr. Mingo-Castel's laboratory) was used as a template . 
The PCR product was cloned into PCR 2.1 vector, excised as an Eco RI-NotI fragment and 
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introduced into the pLD vector. (Update "Human Therapeutic Proteins") The vector includes tji e 

chloroplast preferred Ribosome Binding Site (RBS) sequence GGAGG. 
■ ■ ■ — — > 

5'UTRpsbA- ATG-HSA 

The 200 bp tobacco chloroplast DNA fragment containing the 5' psbA UTR was 

, amplified using PCR and tobacco DNA as template. The fragment was cloned into PCR 2.1 
vector, excised EcoRI-Ncol fragment was inserted at the Ncol si te of the ATG-HSA and finally 
inserted into the pLD vector as an EcoRI-NotI fragment downstream of the 16S rRNA promoter 
t o enhance translation of the protein. (Update "Human Therapeutic Proteins") HSA was cloned 
downstream of the psbA 5* UTR including the promoter and untranslated region, which has been 
shown to enhance translation . 
BtORFl +2- ATG-HSA 

ORF1 and ORJF2 of the Bt Cry2Aa2 operon were amplified in a PCR using the complete^ 
operon as a templa te. The fragment was cloned into PCR 2. 1 vector, excised as an EcoRI-EcoRV 
frapmpnt. inserted at EcoRV site with the ATG-HSA sequence and introduced into the pLD 
vector as a n EcoRT-NotI fragment. The ORF1 and ORF2 were fused upstream of the ATG-HSA. 
(Update "Human Therapeutic Proteins") This introduced the putative chaperonin (ORF2) of thg 

,B-t. cry2Aa2 operon upstream of the HSA gene, which has been shown to fould foreign proteins 
and form crystals, aiding in protein stability and purificatio n. 
BtORFl+2-5'UTRpsbA-ATG-HSA 

The 5'UTRpsbA was introduced in the above vector upstream of the HSA at the EcoRV- 
Ncol site. 

Because of the similarity of protein synthetic machine ry (109), expression of all 
chloroplast vectors was first tested in E.coli before their use in tobacco transformation. Different 
levels of expression were obtained in E. coli depending on the construct (Figure 24). Using the 
psbA 5 T UTR and the ORF1 and ORF2 of the cry2Aa2 operon, we obtained higher l evels of 
expressi on than using only the RBS. We have observed in previous experiments that HSA in E. 
coli is completely insoluble (as is shown in ref 14), probably due to an improper folding resulting 
from the absence of disulfide bonds. This is the reason why the p rotein is precipitated in the gel 
(Figure 24). Different polypeptide sizes were observed, probably due to incomplete translation. 
Assuming that E. coli and chloroplast have similar protein synthesis machinery, one could expect 
different levels of expression in transgenic tobacco chloroplasts depending on the regulatory 
sequences, with the advantage that disulfide bonds are forme d in chloroplasts (9). These three 
vectors were bombarded into toba cco leaves- via particle bombardment (110) and after 4 weeks 
small shoots appeared as a r esult of independent transformation events. They all were tested by 
PCR to check integration in the chloroplast genome as shown in Figs. 10A and B. The positive 
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clones were transferred to pots. Transgenic leaves analyzed by western blots showed differen t 
levels of expression depending on the 5 1 region used in the transformation vector. Maximum 
levels were observed in the plants transformed with the HSA preceded by the 5' UTR of the 
psbA gene. Quantification of the HSA and molecular analysis of these transformants are in 
t progress. 

( Update "Human Therapeutic Proteins") All chloroplast vectors were bombarded into 
tnhar.r.n leaves via particle bombardment and after 4 weeks shoots appeared as a result o f 
independent transformation events. All shoots were tested by PCR to verify in tegration into the 
phloroplast genome. The positive clones were passed through a second round of selection to 
achieve homoplasmy and transferred to pots. The phenotype of these plants was completel y 
normal. Transgenic leaves analyzed by western blots showed consistently the same patterh of 
expression depending on the 5' region used in the transformation vector (see Figure 38). 
Maximum levels of expression were observed in the plants transformed with the HSA preceded 
hy the psbA 5' UTR and promoter. Molecular characterization of the firs t generation is in 
progress. Southern blots of several clones showed homoplasmy in all transgenic lines except one ^ 
(see clone # 6, Figure 39). Northern blots showed different length of transcripts depending on the 
5' regulatory region that was inserted upstream of the HSA gene (see Figure 40). The mo st 
abundant transcript was the monocistron in plants with the 5'psbA promoter upstream of the 
HSA gene. Polycistrons of different length were observed based on the number of promoters 
used in each construct and differential processing. 

We have observed different levels of HSA in ELISA depending on the extraction buffer 
used and further optimization of this procedure is in progress. With incomplete extraction 
procedures, the highest HSA level of expression in plants transformed with pLD-5 f psbA-HSA 
was up to 11.1% of total soluble protein; this is more than 100 fold the expression observed with 
other two constructs (see Figure 41). Because we have routinely observed high levels of foreign 
gene expression with other two vectors, we anticipate that t he actual level of HSA expression in 
p LD-5'psbA-HSA may exceed 50% of total soluble protein. Since the expression of HSA under 
the 5'psbA control is light dependent, the time of the tissue harvest for expression studies i| 
important. Such changes in HSA accumulation are currently being investigated using ELISA and 
Northerns. 

Charar.teri7.atinn of HSA from transg enic chloroplasts for proper folding, disulfide bond 
formati on and functionality is in progress. The stromal pH within chloroplasts and the presence, 
nf hnt h thiorpHnxin nnH disulfide isomerase systems provide optimal conditions for proper 
folding and disulfide bond formation within folded HSA. 
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JBNTERFERON-a5 

Interferon-a5 has not been expressed yet as a commercial recombinant protein. The firdt 
attempt has been made recently. The IFN-a5 gene was cloned and the sequence of the mature^ 
protein was inserted into the pET28 vector, that included the ATG, histidine tag for purification 
and thrombin cleavage sequences. The tagged IFN-a5 was purified first by binding to a nickel 
column and biotinylated thrombin was then used to eliminate the tag on IFN-a5. Biotinylated 
thrombin was removed from the preparation using streptavidin agarose. The expression level was 
5.6 micrograms per liter of broth culture and the recombinant protein was active in antiviral 
activity similar or higher than commercial IFN-a2 (Intron A, Schering Plouth). 

(Update "Human Therapeutic Proteins") As proposed, we have cloned human IFNa5, 
fused with a Histidine tag and introduced the gene into the chloroplast transformatio n vector 
(pLDV Western blots demonstrated expression of the IFNa5 protein in E. coli using pLQ 
vectors, and the maximum level was observed with the 5'psbA UTR and promoter. IFNctS gene v 
was cloned into the pLD using both sequences and bombarded into tobacco leaves. Shoots 
appeared after 5 weeks and the second round of selection is in progress.. 

Iysulin-like Growth Factor-I (IGF-I) 

Recent studies have demonstrated that treatment with low doses of TGF-T induced 
significant improvements in nutritional status (52), intestinal absorption (53-55), osteopenia (56), 
hypogonadism (57) and liver function (58) in rats with experimental liver cirrhosis. These data 
s upport that IGF-I deficiency plays a pathogenic role in several systemic complications 
occurring in liver cirrhosis. Clinical studies are in progress to ascertain the role of IGF-I in the i 
management of cirrhotic patients. Unpublished studies indicate that IGF-I could also be used in 
patients with articular degenerative disease (osteoarthritis). 

(Update "Human Therapeutic Proteins") From previous studies we observed that IGF-I 
gene coding sequence is not suitable for high levels of expression in chloroplasts. Therefore, we 
have determined the optimal chloroplast sequence and employed a recursive PCR method for 
total gene synthesis (see F igure 42). The newly synthesized gene was cloned into a PCR 2.1 
vector. Insertion of zz-tev sequence upstream of IGF1 coding sequence for facilitating 
subsequent purification is in progress. 

To demonstrate expression, purification and proper cleavage of the fusion protein we also 
cloned the full length IGF-I (including the pre-sequence) in an alphavirus vector and expressed 
th e protein in human cultured cells. Alphavirus system has been used because it expresses 
a dequate amounts of protein to induce a very good immune response in test animals. We 
observed that the protein had the predicted size, is properly cleaved in cells to produce the 
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mature protein and is exported into the growth medium. This secreted protein could be 
Jmmunoprecipitated using anti-IGF-I antibody. The zz-tev-IGF-I was also cloned in an , 
, alphavirus vector, expressed and labeled in human cultured cells. This has allowed us to see tha{ 
tte protein had the predicted size and as expected, is not secreted. To cleave 77 tag a fW 
p urification from chloroplasts, TEV protease is necessary. Therefore, we have expressed an^ 
purified TEV protease in bacteria. After purification we could obtain approximately 0.5 me. This 
T EV protease cleaved the labeled zz-tev-IGF-I producing two fragments, zz-tev and mature IGF- 



I. We are currently l abeling more fusion protein to optimize conditions for TEV cleavage . 



Experimental 
ffxample 1 

^valuation of chloroplast gene expression 

A .systematic, approach is used to i dentify and overcome potential limitations of foreign 

gene expression in chloroplasts of transgenic plants. This experiment increases the utility of 

chloroplast transformation system by scientists interested in expressing other foreign proteins. 

Therefore, it is important to systematically analyze transcription, RNA abundance, RNA 

s tability, rate of protein synthesis and degradatio n, proper folding and biological activity. The 
*- — . \ 

rate of transcription of the introduced HSA gene is compared with the highly expressing 

endogenous chloroplast genes (rbcL, psbA, 16S rRNA) , using run on transcription assays to" 

determine if the 16SrRNA promoter is opera ting as expected. The transcription efficiency of ' 

t ransgenic chloroplast containing each of the three cons tructs with different 5' regions is tested/ 

Similarly, transgene RNA levels are m onitored by northerns, dot blots and primer extension, 

relative to endogenous rbcL, 16S rRNA or psbA. These results, al ong with run on transcription 

■ ~ — — — ■ — f 

assays provide valuable information of RNA stability, processing, etc. RNA appears to be 

extremely stable ba sed on northern blot analysis. This systematic study is valuable to advance 

Utility nf this system hv other scientists. Most imnortantl v . the efficiency of translation is tested 

in isolated chloroplasts and co mpared with the highly translated chloroplast protein (vsbA ). 

Pulse ch ase experiments help assess if trans lational pausing, premature termination occurs. 

Evaluation of perce nt RNA loaded on polysomes or in constructs with or without 5'UTRs helps " 

le the effi ciency of the ribosome bindin g site and 5 f stem-loop translational enhancers. 

Codon opt imized genes (IGF-I, IFN) are comp ared with unmodified genes to investigate the rate 

of translat ion, pausing and termination. A 200-fold difference in accumulation ot" foreign 

proteins due to decreases in proteolysis conferred by a putative chaperonin 
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Therefore, proteins from constructs expressing or not expressing the putative chaperonin (with or 
without ORP1+2) provide valua ble information on protein stability. 

Example 2 

Expression of the mature-protein 

HSA, Interferon and IGF-I are pre-proteins that need to be cleaved to secrete mature 
proteins. The codon for translation initiation is in the presequence. In chloroplasts, th e necessity 
of expre ssing the mature protein forces introduction o f this additional amino acid in coding 
sequences. In order to optimize expression levels, we first subclone the sequence of the mature 
proteins beginning with an ATG. Subsequent immunological assays in mice demonstrates th e 
extra-methionine causes immunogenic response and low bioactivitv. Alternatively- systems may 
also produce the mature protein. These systems can include the synthesis of a protein fused to a 
peptide that is cleaved intracellular (processed) by chloroplast enzymes or the use of chemica l 
or enzymatic cleavage after partial purification of proteins from plant cells. 

Use of peptides that are cleaved in chloroplast 

Staub et al. (9) reported chloroplast expression of human somatotropin similar to th e 
native human protein by using ubiquitin fusions that were cleaved in the stroma by an ubiquitin 
protease. However, the proces sing efficiency ranged from 30-80% and the cleavage site was not 
accurate. In order to process chloroplast expressed proteins a peptide which is cleaved in the 
stroma is essential. The transit peptide sequence of the RuBisCo (ribulose 1,5-bisphosphate 
carboxylase) small subunit is an ideal choice. This tran sit peptide has been studied in depth 
ail). RuBisCo is one of the proteins that is synthesized in cytoplasm and tmn^pnrte.H 
postransl ationallv into the chloroplast in an energy dependent process. The transit peptide is^ 
proteolvticallv removed upon transport in the stroma bv the stromal processing peptidase (112>> ^ 
There are several sequences described for different species (113). A transit peptide c onsensus 
sequence for the RuBisCo small subunit of vascular plants is publish ed by Keegstra et al. (1 14). 
T he amino acids that are proximal to the C-terminal (41-59) are highly conserv ed in the higher 
, plant transit sequenc es and belong to the domain which is involved in enzymatic cleavage 
The RuBisCo small subunit transit nentide has been fused with v arious marker proteins 
(1 14.115V even with animal p roteins (116.117). to target proteins to the chloroplast. Prior to 
transformation Studies, the cleavage efficiency and accuracy are tested bv in vitro translation of 
t hfi fusion nrotfiins and in aryane.llo import studi e s usinp intact chloroplasts. Thereafter, 
i knowing the correct fusion sequence for producing the mature p r otein, such sequence encoding 
the amino terminal portion of tnhar.r.n r.hlnrnplast transit peptid e \<z linked with thp. mature 
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^pgiiftn™ nf mrh prntein, Codon composition of th e tobacco RuBisCo small subunit transit , 
ppptiHp ic compatible with chlor oplast optimal translation (see section d3 and table 1 on page 
30). Additional transit peptide sequences for targeting and cleavage in the chloroplast have been 
described fill). The lumen of thylakoids could also be a good target because thvlakoids are 
readily purified. Lumenal proteins can be freed either bv so nication or with a very low triton 
X100 concentration, although this requires insertion of additional amino acid sequences for 
efficient import (1 1 1). 

Example 3 

Use of chemical or enzymatic cleavage 

The strategy of fusing a protein to a tag with affinity for a certain ligand has been us ed 
extensively for more than a decade to enable affinity purification of recombinant products (1 18- 
1 201 A vast number of cleavage methods, both chemical and enzymatic, have been investigate d 
for this purpose (120). Chemical cleavage methods have low specificity and the relatively harsh , 
cleavage conditions can result in chemical modifications of the released products (120). Some of 
the enzymatic methods offer significantly higher cleavage specificities together with high 
pfflriRnr.y e g suhtilisin. IgA protease and factor Xa (1 19,120), but these enzymes hav e 

the drawback of being quite expensive. 

Trypan which clea ves C-terminal of basic amino-acid residues, has been used for a long ^ 
time to cleave fusion proteins (1 4. 12 IV Despite expected low specificity, trypsin has been shown 
to be useful for specific cleavage of fusion proteins, leaving basic residues within folded protein 
domains uncleavaged (121). The use of trypsin only requires that the N-terminus of the mature 
protein be accessible to the protease and that the potential internal sites are protected in the T 
r^t; Yf conformatio n T r >T g1 'r > oMit'*™^ advantag e of being inexpensive and readily 

available. In the case of HSA, when it was expressed in E. coli with 6 additional cbdons coding 
. for a trypsin cleavage site. HSA was processed successfully into the mature protein after 
treatment with the protease. In addition, the N-terminal sequence was found to be unique and 
identical to the sequence of natural HSA, the conversion was complete and no degradation 
products were observed (14). This in vitro maturation is selective because correctly fo lded 
albumin is highly resistant to trypsin cleavage at inner sites (14). This system could be tested for 
rhlnroplnqfc T-f^s A yp.r.tnrs using pr otein expressed in E. coli, 

Stanh e.t al. (9} demonstrated that the chloroplast methionine aminooeptidase is active and 
thev found 95% of removal of the first methionine of an ATG-somatotropin protein that was 
expressed via the chloroplast genome. There are several investigations that have shown a very 
strict pattern of cleavage by this peptidase (122). Methionine is only removed when second 
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residues are glycine, alanine, serine, cysteine, threonine, proline or valine, but if the third amino 
acid is proline the cleavage is inhibited. In the expression of our three proteins we use this 
approach to obtain the mature protein in the case of Interferon because the penultimate 
aminoacid is cvstein followed by aspartic acid. For HSA the second aminoacid is aspartic acid 
and for IGF-I glycine but it is followed by proline, so the cleavage is not dependable. 

For IGF-I expression, the use of the TEV protease (Gibco cat n 10127-017) would be 
ideal. The cleavage site that is recognized for this protease is Glu-Asn-Leu-Tyr-Phe-Gln-Glv 
(SEP ID NO: 10) and it cuts between Gln-Glv. This strategy allows the release of the mature 
protein by incubation with TEV protease leaving a glycine as the first amino acid consistent 
with human mature IGF-I protein. 

The purification system of the E. coli Interferon-a5 expression method was based on 6 
Histidine-tags that bind to a nickel column and biotinvlated thrombin to eliminate the tag (SEP 
ID NO: 12) on IFN-a5. Thrombin recognizes Leu-Val-Pro-Arg-Glv-Ser (SEP ID NO: 11) and 
cuts between Arg and Glv. This leaves two extra amino acids in the mature protein, but 
antiviral activity studies have shown that this protein is at least as active as commercial IFN-a2. 

Example 4 

Optimization of gene expression 

Foreign genes are expressed between 3% (crv2Aa2) and 47% (crv2Aa2 operon) in 
transgenic chloroplasts (3,85). Based on the outcome of the evaluation of HSA chloroplast 
transgenic plants, several approaches can be used to enhance translation of the recombinant 
proteins. In chloroplasts, transcriptional regulation of gene expression is less important, 
although some modulations by light and developmental conditions are observed (123). RNA 
stability appears to be one among the least problems because of observation of excessive 
accumulation of foreign transcripts, at times 16,966-fold higher than the highly expressing 
nuclear transgenic plants (124). Chloroplast gene expression is regulated to a large extent at the 
post-transcriptional level. For example, 5' UTRs are necessary for optimal translation of 
chloroplast mRNAs. Shine-Dalgarno (GGAGG) sequences, as well as a stem-loop structure 
located 5' adjacent to the SD sequence, are required for efficient translation. A recent study has 
shown that insertion of the psbA 5* UTR downstream of the 16S rRNA promoter enhanced 
translation of a foreign gene (GUS) hundred- fold (125a). Therefore, the 200-bp tobacco 
chloroplast DNA fragment (1680-1480) containing 5' psbA UTR should be used. This PCR 
product is inserted downstream of the 16S rRNA promoter to enhance translation of the 
recombinant proteins. 

Yet another approach for enhancement of translation is to optimize codon compositions. 
Since all the three proteins are translated in E. coli (see section b), it would be reasonable to 
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^expect efficient expression in chloroplasts. However, optimizing codon compositions to matc h 

the psbA gene could further enhance the level of translation. Although rbcL (RuBisCO) is th e 

most abundant protein on earth, it is not translated as highly as the psbA gene due to the f 

gxtremely high turnover of the psbA gene product. The nsbA gene is under stronger caption . 

for increased translation efficiency and is the most abundant thylakoid protein. In addition, the. 

codon usage in higher plant chloroplasts is biased towards the NNC codon of 9-fold de g enPr a tP . 

groups (i.e. TTC over TTT, GAC over GAT, CAC over CAT. AAC over AAT. ATC over ATT 

ATA etc.). This is in addition to a strong bias towards T at third position nf A-fnM d^ngrate - 

groups. There is also a context effect that should be taken into consideration while modifying 

Specific codons. The 2-fold degenerate sites immediately upstream from a GNN codon do not 

show this bias towards NNC. (TTT GGA is preferred to TTC GGA while TTC CGT is preferred 

tP TTT CQT, TTC AfiT tft TTT AfiT and TTC TCT fn TTT TfT)(1?^h ; l?fi) in addition,* 

highly expressed chloroplast genes use GNN more frequently that other genes. Codon , 

composition was optimized by comparing different species. Abundance of amino acids in ^ 

chloroplasts and tRNA anticodons present in ch loroplast must be taken into consideration. We : 

" — — — ^ — » 

also compared A+T% content of all foreign genes that had been expressed in transgenic 
chloroplasts in our laboratory with the percentage of chloroplast expression. We found thaf 
hiphflr Iftvr.ls of A+T always correlated with hiyh expression levels ( see tahle 11 Tt is also. 
possible to modify chloroplast protease recognition sites while modifying codons, without 
affecting their biological function s. 

The study of the sequences of HSA. IGF-I and Interferon-^ was Hone The hsa 

Sequence Showed 57% of A+T content and 40% of the total mHnnc mntrhed w ith thp pchA mq cf 

translated codons. According to the data of tahle 1. we expected good rhloroplast pyprpccinn of 
Jhe HSA Rene without anv modifications in its codon composition TTTM.ns ft-uy 
content and 40% of matching with psbA codons. The composition seems to he good hut this 
protein is small (166 amino acidsl and the sequence was optimized to achieve A+T levels close 
tQ 65%. Finally, the analysis of the IGF-I sequence showed t hat the A+T content was 40% and . 
only 20% of the codons are th e most translated in psbA. Therefore, this gene needed to be , 
o ptimized. Optimization of these two genes is done using a novel PCR approac h (127,128) 
which has been successfully used to optimize codon composition of o ther human proteins. 

Example 5 

Vector constructions 



For all the constructs pLD vector is used. This vector was developed in this laboratory for 
c hloroplast transformation. It contains the 16S rRNA promoter (Prrn) driving the selectable 
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marker gene aadA (aminoglycoside adenyl trans ferase conferring resistance to spectinomycin) 
followed by the psbA 3' region (the terminator from a gene coding for photosystem II reaction 
center components) from the tobacco chloroplast genome. The pLD vector W a universal r 
chloroplast expression /integration vector and can be used to transform r.hlo rr>pic»ct ^ n om°g of 
several other plant species (73,86) because these flanking sequences are highly conserved among 
higher plants. The universal vector uses trnA and trnl genes (chloroplast transfer RNAs coding 
for Alanine and Isoleucine) from the inverted repeat region of the tobacco chloroplast genome as ^ 
flanking sequences for homologous recombination. Because the universal vector integrates 
foreign genes within the Inverted Repeat region of the chloroplast genome, it should double the 
copy number of the transgene (from 5000 to 10,000 copies per cell in tohaccoV Fnrthprmorp it 
has been demonstrated that homoplasmv is achieved even in the first round of gf»lfwj^ n j n 
tobacco probably because of the presence of a chloroplast origin of replication within the 
flanking sequence in the universal vector (thereby providing more templates for integration). 
Because of these and several other reasons, foreign gene expression w as shown to be much 
higher when the universal vector was used instead of the tobacco specific vector (88). 

The following vectors are used to optimize protein expression, purification and production of 
proteins with the same amino acid composition as in human proteins . 

a) In order to optimize expression, translation is increased using the psb^^UTR and 
optimizing the codon composition for protein expression in chloroplasts according to criteria^ 
discussed previously. The 200 bp tobacco chloroplast DNA fragment containing 5* psbA 
UTR is amplified by PCR using tobacco chloroplast DNA as template. This fragment is 
cloned directly in the pLD vector multiple cloning site (EcoRI-Ncol) downstream of the 
promoter and the aadA gene. The cloned sequence is exactly the same as in the psbA gene. 

b) For enhancing protein stability and facilitating purification, the crv2Aa2 Bacillus w 
t huringiensis operon derived putative chaperonin is used. Expression of the crv2Aa2 operon 
in chloroplasts provides a model system for hyper-expression of foreign proteins f46% of 
total soluble nrotein) in a folded configuration enhancing their st ability anH facilitating 
purification (3). This justifies inclusion of the putative chaperonin from the crv2Aa2 operon 
in one of the newly designed constructs. In th is region there are two open retina f^m^c 

■ fPRFl and ORF7,) and a ribosomal binding site frhs). This sequence contains element 
necessary for Crv2Aa2 crystallization which heln to crystallize the HSA IGF -f and TFN-a . 
proteins aiding in the subsequent purification. Successful crystallization of other protein s 
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using this putative chaperonin has been demonstrated (94 ). We amplify the ORF1 and ORF2 
of the Bt Cry2Aa2 operon by PCR using the compiete.operon as template. The fragment is 
cloned into a PCR 2.1 vector and excised as an EcoRJ-EcoRV product. This fragment is then 
cloned directly into the pLD vector multiple cloning site (EcoRI-EcoRV) downstream of the . 
promoter and the aadA gene. 

c) To obtain proteins with the same amino acid composition as mature human proteins, we first 
fuse all three genes (codon optimized and native sequence) with the RuBisCo small subunit 
transit peptide. Also other constructions are done to allow cleavage of t he protein after 
isolation from chloroplast. These strategies also allow affinity purification of the proteins. 

The first set of constructs includes the sequence of each protein beginning with an ATG, . 
introduced by PCR using primers. Processing to get the mature protein may be performed wherq 
the ATG is shown to be a problem (determined by mice immunolo gical assays^ First we use 
the RuBisCo small subunit transit peptide. This transit peptide is amplified by PCR using 
;obacco DNA as template and cloned into the PCR 2.1 vector. All genes are fused with the 
transit peptide using a Mlul restriction site that is introduced in the P CR primers for 
amplification of the transit peptide and genes coding for three proteins. The gene fusions are 
inserted into the pLD vectors downstream of the 5'UTR or ORE 1+2 using th e restriction sites 
Ncol and EcoRV respectively. If use of tags or protease sequences i s necessary such sequences 
can be introduced by designing primers including these sequences and amplifying the gene with 
PCR. After completing vector constructions^!! the vectors are sequenced to confirm correct 
nucleotide sequence and in frame fusion. DNA sequencing is done u sing a Perkin Elmer AB1 
prism 373 DNA sequencing system. 

TW.anQe. nf the simUarity7of protein synthetic machinery (1091 expression of all 

chloroplast vectors is first tested in Kcoli before their use in tobacco transform ation. For 

jr*rhrrrr>hin mli expression XL- 1 Blue strain is used. E. coli can be transformed by standard 
CaCb transformation procedures and grown in TB culture media. Purification, biological and 
immunogenic assays are done using E . coli expressed proteins. 

Example 6 

Bombardment, Regeneration and Characterization of Chloroplast Transgenic Plants 

Tobacco (Nicotiana tabacum var. Petit Havana) plants are grown aseptically by 
germination of seeds on MSO medium. This medium contains MS salts (4.3 g/liter), B5 vitamin 
mixture fmvo-inositoL 100 mg/liter; thiamine-HCl, 10 mg/liter; nicotinic acid. 1 mg/liter; 
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pyridoxine-HCl, 1 mg/liter), sucrose (30 g/liter) and phytagar (6 g/liter) at pH 5.8. Fully 
expanded, dark green leaves of about two month old plants are used for bombardment- 
Leaves are placed abaxial side up on a Whatman No. 1 filter p aper laying on the RMOP 
medium (79) in standard petri plates (100x15 mm) for bombardment. Gold (0.6 ^m) 
microproiectiles are coated with plasmid DNA (chloroplas t vectors) and bombardments are 
carried out with the biolistic device PDSIOOO/He (Bio-Rad ) as described by Daniell (110). 
Following bombardment, petri plates are sealed with pa rafilm and incubated at 24°C under 12 h > 
photoperiod. Two days after bombardment, leaves are chopped into small pieces of ~5 mm 2 in. 
size and placed on the selection medium (RMOP cont aining 500 ng/ml of spectinomycin 
dihydrochloride) with abaxial side touching the medium in deep (100x25 mm) pe tri plates (-10 
.pieces per plate). The regenerated spectinomycin resistant shoots are chopped into small pieces 
(~2mm 2 ) and subcloned into fresh deep petri plates (~5 pieces per plate) containing the same 
selection medium. Resistan t shoots from the second culture cycle are then transferred to the 
rooting medium fMSO medium supplem ented with IBA. 1 me/liter and spectinomycin , 
dihydrochloride, 500 mg/liter). Rooted plants are transferred to soil and grown at 26°C under 16 
hour photoperiod conditions for further analysis. 



PCR analysis of putative transformants 

PCR is done using DNA isolated from control and transgenic, nlants in orrifir tn . 
di stinguish a) true chloroplast transformants from mutan ts and b) chloroplast tr 
nuclear transformants. Primers for testing the pres e nce of the aadA gene (that confers . 
spectinomycin resistance) in transgenic p lants are landed on the aadA mriinp seniirnrr anri IfiS 
rRNA gene. Tn order to test chlorop l ast integration nf the genes one nrimPT lands, on thfi aadA 
sene while another lands on the native chloroplast genome. 1 No PCR product is obtained with 
nuclear transgenic plants using this set of primers, the primer set is used to test integration of the 
entire gene cassette without any internal deletion or looping out during homologous t 
r ecombination. Similar strategy was used successfully to confirm chloroplast integration o f 
foreign genes (3, 85-88). This screening is essential to eliminate mutants and nuclear 
t ransformants. In order to conduct PCR analyses in transgenic plants, t otal DNA from. 
unbombarded and transgenic plants is isolated as described by Edwards et al. (129). Chloroplas t 
transgenic plants containing the desired gene are then moved to second round of selection in 
order to achieve homoplasmy. 



Southern Analysis for homoplas my and copy number 

Southern blots are done to determine the copy number of the introduced foreign gene per 
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cell as well as to test homoplasmy. There are several thousand copies of the chloroplast genom e 
present in each plant cell. Therefore, when foreign genes are inserted into the chloroplast^ 
genome, some of the chloroplast genomes have forei gn genes integrated while others remain as. 
t he wild tvne (heteroplasmy). Therefore, in order to ensure that only the transformed geno me 
exists in cells of transgenic plants (homoplasmy), the selection process is continued. In order to 
confirm that the wild type genome does not exist at the end of the selection cycle, total DNA 
from transgenic plants are probed with the chloroplast border (flanking) sequences (the trnl-trnA ^ 
fragment). When wild type genomes are present (heteroplasmy), the native fragment size is 
observed along with transformed genomes. Presence of a large fragment (due to insertion of 
foreign genes within the flanking sequences) and absence of the native small fragment confirms | 

homoplasmy (85,86,88). 

The copy number of the integrated gen e is determined by establishing homoplasmy for, 
the transgenic chloroplast genome. Tobacco chloroplast s contain 5000-10,000 copies of their 
f ennme ne.r ceH f*<* Tf nnlv a W.tinn of the genomes are actually transformed, the copy 
number, bv default, must be less than 10.000. By e stablishing that in the transgenics the gene , 
inserted transformed genome is the only one present, one can establish that the copy numb er^ 
.5000-1 0 000 ner cell. This is usually done by digesting the total DNA with a suitable restric tion 
enzyme and probing with the flanking sequences that enable hom ologous recombination into the 
chloroplast genome. The native fragment present in the control should be absent in the 
transgenics. The absence of native fragment proves that only the transgenic chloroplast genome 
is present in the cell and there is no native, untransformed, chloroplast genome, without the 
foreign gene present. This establishes the homoplasmic nature of ou r transformants, 
simultaneously providing ,.s with «m estimate of 5000-1 0.000 copies of the foreign genes per , 
. cell. 

Northern Analysis for transcript stability 

Northern blots are done to test the efficiency of transcription of the genes. Total RNA is. 
i solated from 150 mg of frozen leaves bv using the "Rneasv Plant Total RNA Isolation Kit" 
(Qiagen Inc., Chatsworth, CAl RNA (10-40 tig) is denatured by formaldehyde treatment , 
separated on a 1.2% agarose gel in th e presence of formaldehyde and transferred ft a , 
nitrocellulose membrane (MSn as described i n Sambrook et al. (130). Probe DNA (proinsulin . 
gene coding region) is labeled h Y the rando ^ -pHir"* method fPmmr^ with 32 P rlPTP i-ntnpn - 
The hint is p~.hvhriHiTe.ri. n vhridizeri and washed as described above for southern blot analysis., 
Transcript levels are quantified by the Molecular Analy st Program using the GS-700 Imaging 
Densitometer (Bio-Rad, Hercules, CA). 
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Expression and quantification of the total protein expressed in chloroplast 

Chloroplast expression assays are done for each protein by Western Blot. Recombinant 
protein levels in transgenic plants are determined using quantitative ELISA assays. A standard 
curve is generated using known concentrations and serial dilutions of recombinant and native 
, proteins. Different tissues are analyzed using young, mature and old leaves against these primary 
antibodies: goat anti-HSA (Nordic Immunology), anti-IGF-I and anti-Interferon alpha (Sigma). 
Bound IgG is measured using horseradish peroxidase-labelled anti-goat IgG. 



Inheritan ce of Introduced Foreign Genes 

While it is unlikely that introduced DNA would move from the chloroplast genome to. 
nuclear genome, it is possible that the gene coul d get integrated in the nuclear genome during , 
bombardment and remain undetected in S o uthern analysis. Therefore, in initial tobacco . 
transformants. some are allowed to self-poll inate, whereas others are used in FciprocaJ crgssgs , 
wi th control tobacco (transgenics as female accepters and pollen donors; testing for maternal ^ 
inheritance). Harvested seeds (Tl) will be germinated on media containing spectinomycin. 
.Achievement of homoplasmy and mode of inheritance can be classified by looking at 
germination results. Homoplasmy is indicated by totally green seedlings (86) while heteroplastic 
is displayed by variegated leaves (lack of pigmentation, 83). Lack of variation in chlorophyll 
pigmentation among progeny also underscores the absence of position effect, a n artifact of 
nuclear transformation. Maternal inheritance is be demonstrated by sole transmission of 
introduced genes via seed generated on transgenic plants, regardless of pollen source (green 
seedlings on se lective media). When transgenic pollen is used for pollination of control plants 
resultant prog eny do not contain resist ance to chemical in selective media (will appear pleacnecf; 
83). Molecular analyses confirm transmission and expression of introduced genes, and vi seedis 
generated from those confirmed plants by the analyses describe d above. . 

Example 7 

Purification methods 

T he standard method of purification employs classical biochemical techniques with the 
. crystallized proteins inside the chloroplast. In this case, the homogenates are passed through 
miracloth to remove cell debris. Centrifugation at 10,000 xg pelletizes all foreign proteins (3) . 
Proteins are solubilized using pH, temperature gradient, etc. This is possible if the ORF1 and 2 
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of the cry2Aa2 operon (see section c) can fold and crystallize the recombinant proteins as 

expected. Were there is no crystal formation, other purification methods must be used (classica l 

biochemistry techniques and affinity columns with protease cleavage). 
— r 

HSA: Albumin is typi cally administered in tens of gram quantities. At a purity level of 99.999% 
(a level considered suffici ent for other recombinant protein preparations), reco mbinant HSA 
(rHSA) impurities on the order of one m g will still be injected into patients. So impurities from 
the host organism must be reduced to a minimum. Furthermore, purified rHSA must be identical 
to human HSA. Despite these stringent requirements, purif ication costs must be kept low. To . 

, purify the HSA obtained bv gene manipulation , it is not appropriate to'annlv the conventional — 
processes for purifying HSA originating in plasma as such. This is because the impurities to. he , 
eliminated from rHSA completely differ from those contained in the HSA originating in plasma. 
Namely, rHSA is contaminate d with, for example, coloring matters characteristic to recombinant 
HSA, proteins originating in the host cells, polysaccharides, etc. In particular, it is necessary to 
sufficiently eliminate components originating in the host cells, since tney are roreign mandirftft 

x living organisms including human and can cause the problem of antigenicity. ^ 

In plants two different methods of HSA purification have been done at laboratory scale. > 
S ijmons et al. (23) transformed potato and tobacco plants wit h Asrobacterium tumefaciens. For, 
the extractio n and purification of HSA, 1000 g of stem and leaf tissue was homogenized in 1000 
ml cold PBS, 0.6% PVP, 0.1 mM PMSF and 1 mM EDTA. The homogenate was clarified by 
filtration, centrifuged and the supernatant incubated for 4 h with 1.5 ml polyclonal antiHSA 
c oupled to Reactigel spheres (Pierce Chem) in the presence of 0.5% Tween 80. The complex 
HSA-anti HSA-Reactigel was collected and wash ed with 5 ml 0.5% Tween 80 in PBS. HSA was 
desorbed from the reactigel complex with 2.5 ml of 0.1 M glycine pH 2.5, 10% dioxa"n^ 
immediately followed by a buffer exchange with Sephadex G2 5 to 50 mM Tris pH 8. Trie 
sample was then lo aded on a HR5/5 MonoQ anion exchange column (Pharmacia) and eluted 
with a linear NaCl gradient (0-350 mM NaCl in 50 mM Tris pH 8 in 20 min at lml/min). v 
Fractions containing the concentrated HSA (at 290 mM NaCl) were lyophilized and applied to a 
HR 10/3 0 Sepharose 6 column (Pharmacia) in PBS at 0.3 ml/min. However, this method uses . 
affinity columns (polyclonal ant i-HSA) that are very expensive to scale-up. Also the protein is 
r eleased from the column with 0.1M glycine pH 2.5 that will most probably, denature the 
protein. Therefore, this method can suitably modified to reduce these drawbacks. 
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The second method is for HSA extraction and p i .rifWinn from notafn tuhrrs ff)r 
Mineo-Castel's laboratory). After grinding the t u ber in nhosnhate buffer r>H 7,4 (1 mp/?ml) tnP , 
homogenate is fibred in rnin r ^h .nH r.entrifimed at 14 (KM mm H minute After thir FtPip— 
another filtration of the supernatant in 0.45 urn fi tters is necessary. Then, chromatography of 
ionic exchange in FPLC using a DEAE Sepharose Fast Flow column (Amersham) is required- 
Fractions recovered are passed through an affinity column (Blue Sepharose fast flow Amersham) 
llting in a product of high purity. HSA purification based on either method is acceptable. 



resul 



IGF-1: All earl ier attempts to produce IGF-I in E. coli or Saccharomyces cerevisiae have 
resulted in misfolded p roteins. This has made it necessary to perform additional in vitro refolding 
or extensive separation techniques in order -to recover the native and biologi cal form of the 
m olecule. In addition, IGF-I has been demonstrated to possess an intrinsic thermodynamic 
folding problem with regard to quantitatively folding into a native dis ulfide-bonded 
conformation in vitro (131). Samuelsson et al. (131) and Jo lv et al. (132) co-expressed IGF-I , 
with specific proteins of E. coli that significantly improved the relativ e yields of correctly folded 
prot ein and consequently facilitating purification. Samuelsson et al. (132) fused the protein to 
affinity tags based on either the IgG-binding domain (Z) from Staphylococcal protein A or the 
two serum albumin domains (ABP) from Streptococcal protein G (134). The fusion protein 
concept allows the IGF-I molecules to be purified by IgG or HSA affinity chrom atography. We 
also use this Z tags for protein purification including the double Z domain from S. aureus protein 
and a sequence recognized by TEV protease (see section d.2). The fusion protein is incubated 
with an IgG column where binding via the Z domain occurs. Z domain-IgG interaction is very 
specific and has high affinity, so contaminant proteins can be easily wash ed off the column- 
Incub ation of the column with TEY protease elutes mature IGF-I from the column. TEV protease 
is produced in ba cteria in large quantities fused to a 6 histi dine tag that is used for TEV 
purification. This tag can be also used to separate IGF-I from contaminant TEV protease . 

IFN-a: In the E. coli expression method used, the purific ation system was based on using 6 
Hi stidine-tags that bind to a nickel column and biotinylated thrombin to eliminate the tag on 
IFN-a5. 
Example 8 

Characterization of the recombinant proteins 

For the safe use of recombinant proteins as a replacement in any of the current 
applications, these proteins must be structurally equivalent and must not contain abnormal host- 
derived modifications. To confirm compliance with these criteria we compare human and 
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Mass analysis 



ii^osprav mass spectrometry (ESMS) is per fo rmed using a YG Ouattro electrospra^ 
mass spectrometer. Samples are desalted prior to analysis by reverse-ph ase HPLC using an 
acetonitrile gradient containing trifluoroacetic acid. 



CD 
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Spectra are measured in a nitrogen atmosphere using a Jasco J600 spe ctropolarirheter. 

Chromatographic te chniques 

For HSA, analytical gel-permeation HPLC is performed using * TSK G3000 SWxl 
column. Preparative gel permeation chromatography of HS A is performed usinp a Sephacryl 
S200 HR column. The monomer fraction, identified by absorbance at 280 nm, is dialyzed and 
reconcentrated to its starting concentration. For IGF-I, the reversed-phase chromatography the 
SMART system (Pharmacia Biotech) is used with the mRPC C2/18 SC 2.1/10 column. 



Viscosity 



This is a classical assay for recombinant HSA. Viscosity is a characteristic of proteins t 
related directly to their size, shape, and conformation. The viscosities of HSA and recombinant 
HSA can be measured at 100 mg. Ml-1 in 0.15 M NaCl using a U-tube viscometer (M2 type, 
Poulton, Selfe and Lee Ltd, Essex, UK) at 25°C. 



Glycosylation 

Chloroplast proteins are not known to be glycosylated. Howeve r there are no publications 
t o confirm or refute this assumption. Therefore glycosylation should be measur ed using a scaled- 
up version of the method of Ahmed and Furth (136). 



Example 9 
Biological Assays 

Since HSA does not have enzymatic activity, it is not possible to run biological assays. 
However, three different techniques can be used to check IGF-I fun ctionality. All of them are , 
based on the proliferation of IGF-I responding cells. First, radioact ive thymidine uptake can be_ 
measured in 3T3 fibroblasts, that express IGF-I receptor, as an estimate of DN A synthesis. Also, 
a human megakaryoblastic cell line, HU-3, can be used. As HU-3 prows in suspension, changes 
in cell number and stimulation of glucose uptake induced b y IGF-I are assayed using 
AlamarBlue or glucose consumption, respectively. AlamarBlue (Accumed International, 
Westlake.QH) is reduced by mitochondrial enzyme activity. The reduced form of the reagent is 
fluorescent and can be quantitatively detected, with an excitation of 530 nm and an emission o( 
590 nm. AlamarBlue is added to the cells for 24 hours after 2 days inducti on with different doses, 
of IGF-I and in the absence of serum. Glucose consumption by HU-3 cells is then measured 
using a colorimetric glucose oxidase procedure provided by Sigma. HU-3 cells are incubated in 
'the absence of serum with different doses of IGF-I. Glucose is added for 8 hours and glucose 
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concentratio n is then measured in the supernatant. All three methods to measure IGF-I 
functionality are p recise, accurate and dose dependent, with a linear range between 0.5 and 50 . 
ng/ml (137). 



The method to determine IFN activity is based on their anti-viral properties. This 



procedure measures the ability of IFN to protect HeLa cells against the cytopa thic effect of 
encephalomyocarditis virus (EMC). The assay is performed in 96-well microtitre plate. First, 
HeLa cells are seeded in the wells and allo wed to grow to confluency. Then, the medium is 
removed, replaced with me dium containing IFN dilutions, and incubated for 24 hours. EMC 
virus is added and 24 hours later the cytopathic effect is measured. For that, the medium is 



removed and wells are rinsed two times with PBS and stained with methylviolet dye solution. 
The optical density is read at 540 nm. The values of optical density are proportional to the 
antiviral activity of IFN (138). Specific activity is determined wit h reference to standard IFN-a 
(code 82/576) obtained from NIB SC. 



Example 10 

Anim al testing and Pre-Clinical Trials ^ 

Once albumin is produced at adequate levels in tobacco and the physicochemical 
properties of the product corresp ond to those of the natural protein, toxicology studies need to be 
done in mic e. To avoid mice response to the human protein, transgenic mice carrying HSA 
ge nomic sequences are used (139). After injection of none, 1, 10, 50 an d 100 mg of purified 
reco mbinant protein, classical toxicology studies are carried out (body weigh and food intake, . 
animal behavior, piloerection, etc). Albumin can be tested for blood volume replacement after 
paracentesis to eliminate the fluid from the peritoneal cavity in patients with liver cirrhosis. It 
has been shown that albumin infusion after this maneuver is esse ntial to preserve effective . 
ci rculatory volume and renal function (140). 

IGF-1 a nd IFN-a are tested for biological effects in vivo in animal m odels. Specifically, , 
woodchucks (marmota monax) infected with the woodchuck hepat itis virus (WHV), are widely 
considered as the best animal model of hepatitis B virus infection (141 1 Preliminary studies have 
shown a significant increase in 5' oligoadenylate synthase RNA leve ls by real time polymerase 
chain reaction (PCR) in woodchuck peripheral blood mononuclear cells upon incubation with 
human IFN 05, a proof of the biological activity of the human I FN-a5 in woodchuck cells. For in 
vivo studies, a total of 7 woodchucks chr onically infected with WHV (WHV surface antigen and 
WHV-DNA positive in serum) are used: 5 animals are injected subcutaneously with 500,000 
units of human IFND5 (the activity of human IFN-a5 is determined as described previously) 
three times a week for 4 months; the remaining two woodchucks are injected with placebo and 
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used as controls. Follow-up includes w eekly serological (WHY surface antigen arid anti-WHV 
surface antibodies by ELISA) and viroiogical (WHV DNA in serum by real time quantitative 
PCR) as well as monthly immunological (T -helper responses against WHV surface and WHV 
core antigens measured by interieukin 2 production from PBMC incubated with those proteins) 
studies. Finally, basal and end of treatment liver biopsies should be performed to score liver 
inflammation and intrahepatic WHV -DNA levels. The final goal of treatment is decrease of viral 
replication by WHV-DNA in serum, with secondary end points being histological improvemen t 
and decrease in intrahepatic WHV-DNA levels. 

For IGF-1, the in vivo therapeutic efficacy is tested in animals in situations of IGF-I 
deficiency such as liver cirrhosis in rats. Several reports (56-58) have been published showing 
that recombinant human IGF-I has marked beneficial effects in increasin g bo ne and muscle 
mass, improving liver function and correcting hypogonadism. Briefly, the induction protocol is 
as follows: Liver cirrhosis is induced in rats by inhalation of c arbon tetrachloride twice a week 
for 11 weeks, with a progressively increasing exposu re time from 1 to 5 minutes per gassing 
session. After the 11 th week, animals continue receiving CC1 4 once a week (3 minutes per 
inhalation) to complete 30 weeks of CC1 4 administration. During the whole induction period, 
phenobarbital (400 mg/L) is added to drinking water. To test the therapeutic efficacy of tobacco- 
derived IGF-I, cirrhotic rats receive 2 n g/100 g body weight/day of this compound in two 
divided doses, during the last 21 days of the induction protoco l (weeks 28, 29, and 30). On day 
22, animals are sacrificed and liver and blood samples collected. The results are compared to 
those obtained in cirrhotic animals receiv ing placebo instead of tobacco-derived IGF-I, and to 
healthy control rats . 

Expression of the Native Cholera Toxin B Subunit Gene as Oligomers 
Bacterial a ntigens like the B subunit proteins, CTB and LTB, which are two chemically, 
structurally and immunologically similar candidate vac cine antigens of prokaryotic enterotoxins, 
have been expressed in plants. CTB is a candidate oral subunit vaccine for cholera that causes 
-acute watery diarrhoea by colonizing the small intestine and pr oducing the enterotoxin, cholera 
toxin (CT). Cholera toxin is a hexameric AB 5 protein consisting of one toxic 27 kDa A subunit_ 
having ADP ribosyl transferase activity and a nontoxic pentamer of 1 1.6 kDa B subunits (CTB) 
that binds to the A subunit and facilitates its entry i nto the intestinal epithelial cells. CTB when 
administered orally is a potent mucosal immunogen, which ca n neutralize the toxicity of the CT 
holotoxin by preventing it from binding to the intestinal cells (4). This is believed to be a result 
of it binding to eukaryotic cell surfaces via G M i gang liosides, receptors present on the intestinal 
epithelial surface, eliciting a mucosal immune response to pathogens and enhancing the immune 
response when chemically coupled to other antigens (5,6). ~ 
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Native CTB and LT B genes have been expressed at low levels via the plant nucleus. 
Since, both CTB and LTB are AT-rich compared to plant nuclear genes, low expression was " 

probably due to a number of factors such as aberrant mRNA splicing, mRNA instability or 

inefficient codon usage. To avoid these undesirable features synthetic "plan t optimized" genes 
encoding LTB were created and expressed in potato, resulting in potato tubers expressing up to 
10 - 20 /^g of LTB per gram fresh weight (7). However, extensive codon modificatio n of genes 
is laborious, expensive and often not available due to patent restrictions. One of the 
consequences of these constitutively expressed high LTB levels, was the stunted growth of 
transgenic plants that was eventually overcome by tissue specific expression in potato tubers. 
The maximum amount of CTB protein detected in auxin induc ed, nuclear transgenic potato leaf 
tissues was approximately 0.3% of the total soluble leaf protein when the native CTB gene was 
fused to an endoplasmic reticulum retention signal, thus targeting the protein to the endoplasmic 
reticulum for accumulation and assembly (8). 

Increased expression levels of several, proteins have been attained by expressing foreign 
proteins in chloropiasts of higher plants (9-1 1). Human somatotropin has been expressed in 
chloroplasts with yields of 7% of the total soluble protein (12). The accumulation levels of the 
Bt Cry2Aa2 operon in tobacco chloroplasts are as high as 46.1 % of the total soluble plant 
protein (1 3). This high level of expression is attributed to the putative chaperoning, orf 1 and 
orf 2, upstream of Cry2Aa2 in the operon that may help to fold the protein into a crystalline form 
that is stable and resistant to proteolytic d egradation. Besides the ability to express polycistrons, 
yet another advantage of chloroplast transformation I, is the lack of recombinant protein 
expression in pollen of chloroplast transgenic plants. As there is no chloroplast DNA in pollen 
of most crops, pollen mediated outcross of recombinant genes into the environment is minimized 
(10-15)- t 

Since the transcriptional and translational machinery of plastids is prokaryotic in origin 
and the N. tabaccum chloroplast genome has 62.2% AT content, it was likely that native CTB 
genes would be efficiently expressed in this organelle without the need for codon modification. 
Also, codon comparison of the CTB gene with psbA, the major translation product of the 
chloroplast, showed 47% homology with the most frequent codons of the psbA gene. Highly 
expressed plastid genes display a codon adaptation, which is defined as a bias towards a set of 
codons which are complimentary to abundant tRNAs (16). Codon analysis showed that 34% of 
the codons of CTB are complimentary to the tRNA population in the chloroplasts in comparison 

with 5 1 % of psbA codons that are complimentary to the chloroplast tRNA population. 

, Also, stable incorporation of the CTB gene into the precise location between the trnA and 
trnl genes of the chloroplast genome by homologous recombination, should eliminate the 
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'position effect' frequently obser ved in nuclear transgenic plants. This should allow uniform 
expression levels in different transgenic lines. Amplification of the 

transgene, should result in a high level of CTB gene expression since each plant cell contains ur 
to 50,000 copies of the plastid genome (17). Another significant advantage of the production of 
CTB in chloroplasts, is the ability of chloroplasts t o form disulfide bridges (12,18,19) which are j 
necessary for the correct folding and assembly of the CTB pentamer (20). 

In this study, we report the integration of the CTB gene into the inverted repeat region of 

the tobacco chloroplast genome, allowing 2 copies / chloroplast genome of the CTB gene per 
cell, resulting in chloroplasts accumulating high levels of CTB. This eliminates the need to 
modify the CTB gene for optimal expression in plant s. 

Construction of the Chloroplast Expression Vector pLD-CTB: The leader sequence (63 bp) 
of the native CTB gene was deleted and a start codon was introduced at the 5' end . Primers were , 
designed to introdu ce an rbs site 5 bases upstream of the start codon. The CTB PCR product was t 
then cloned into the multiple cloning site of the pCR2.1 vector (Invitrogen) and subsequently 

into the chloroplast expression vector pLD-CtV2 using suitable restriction sites. Restriction 

enzyme digesti ons of the pLD-LH- CTB vector were done to confirm the correct orientation of 
the inserted fragment. 

Expression of the pLD-LH- CTB vector was tested in E. coli XL-1 Blue MRFtc strain t 
before tobacco transformation. E. coli was transformed by standard CaCb transformation 
procedures. Transformed E. coli (24 and 48 hrs culture in 100ml TB with 100 jug/ml ampicillin) 
and untransformed-£ co li (24 and 48, hrs culture in 100 ml TB with 12,5 jug/ml tetracycline) 
were centrifuged for 15 min. The pellet obtained was w ashed with 200mM Tris-Cl twice, 
resuspended in 500 fA extraction buffer (200mM Tris-Cl, pH 

8.0, lOOniM NaCl,,10mM:EDT^.2mM PMSF) and sonicated. To aliquots of 100 jjI transformed 
and untransformed sonicates [containing 50- 100 pig of crude protein extract as determined by v 
Bradford protein assay (Bio-rad)] and purified CTB (100 ng, Sigma), 2X SDS sample buffer was 
added. These sample mixtures were loaded on a 15 % sodium SDS -PAGE gel and 
electrophoresed at 200v for 45 min. in Tris-glycine buffer (25mM Tris, 250 mM glycine, pH 8.3, 

£ , . , ■ 

0.1% SDS). The separated protein was transferred to a nitrocellulose membrane by 
electroblotting at 70v for 90 min. 

Immunoblot Analysis of CTB Production in E. coli: Nonspecific antibody reactions were 
blocked by incubation of the membrane in 25 ml of 5% non-fat dry milk in TBS buffer for 2 h on 
a rotary shaker (40 rpm) followed by washing in TBS buffer for 5 min. The membran e was 
incubated for lh in 30 ml of a 1:5000 dil ution of rabbit anti-cholera nnti. ^mni (Sip™ 0 ) *" Tna^T . 
(TBS with 0.05% Tween-20), containing 1% non-fat dry milk, followed by washing thrice in 
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TBST. Incubation for an hour at room temperature in 30 ml of a 1: 10,000 dilution of alkaline 
phoshphatase conjugated mouse anti-rabbit IgG. (Sigma) in TBST. washing thrice in TBST and 
once with TBS was followed by incubation in the Alkaline Phoshphatase Color Development 
Reagents. BCIP/NBT in AP color development buffer (Bio-Rad) for an hour. 
Bombardment and Regeneration of Chloroplast Transgenic Plants: Fully expanded, dark 
green leaves of about two-month old Nicotiana tabacum var. Petit havana plants were placed 
abaxial side up on filter papers in RMOP (2 1) petridish plates. Microproiectiles coated with 
pLD-LH-CTB DNA were bombarded into the leaves using the biolistic device PDSIOOO/He 
(Bio-Rad). as described by Daniell (2 1). Following incubation at 24°C in the dark for two days, 
the bombarded leaves were cut into small (-5mm 2 ) pieces and placed abaxial side up (5 
pieces/plate) on selection medium (RMOP containing 500 mg/L spectinomvcin dihvdrochloride). 
Spectinomvcin resistant shoots obtained after about 1 - 2 months were cut into small pieces 
( -2mm 2 ) and placed on the same selection medium. 

PCR Analysis: Total plant DNA from putative transgenic and untransformed plants was 
isolated using the DNeasv kit (Piagen). PCR primers 3P (5'AAAACCCGTCCTCAGTTCGGAT 
TGC-3') (SEP ID NO: 13) and 3M (S'-CCGCGTTGTTTCATCAAGCCTTACG-S') (SEP ID 
NP: 14) were used for PCR on putative transgenic and untransformed plant total DNA. Samples 
were carried through 30 cycles using the following temperature sequence: 94°C for 1 miru 62°C 
for 1 ,5min and 72°C for 2 min. Cycles were preceded by denaturation for 5 min at 94°C. PCR 
confirmed shoots from the second selection were transferred to rooting medium (MSP medium 
containing 500 mg/L spectinomvcin). 

Southern Blot Analysis: Ten micrograms of total plant DNA (isolated using DNeasv kit) p er 
sample were digested with BgllL separated on a 0.7% agarose gel and transferred to a nylon 
membrane. A 0.8 kb fragment probe, homologous to the chloroplast border sequences, was 
generated when vector DNA was digested with BglH and BamHI. Hybridization was performed 
using the Ready To Go protocol (Pharmacia). Southern blot confirmed plants were transferred to 
pots. Pn flowering, seeds obtained from Tnjines were germinated on spectinomvcin 
dihvdrochloride-MSP media and T^seedlings were grown in bottles containing MSP with 
spectinomvcin (500 mg/L) for 2 weeks. The plants were later transferred to pots. 
Western Blot Analysis of Plant Protein: Transformed and untransformed leaves (100 mg) 
were ground in liquid nitrogen and resuspended in 500 fal of extraction buffer (200mM Tris-CL 
pH8.0, 100 mM NaCK lOmM EDTA. 2 mM PMSF). Leaf extracts (100 - 120 ug as determined 
bv Lowrv assay) were boiled (4 min) and unboiled in reducing sample buffer (BioRad) and 
electrophoresed in 12% polyacrvlamide gels using the buffer system of Laemmli (22). The 
separated proteins were transferred to a nitrocellulose membrane bv electroblotting at 85v for lh. 
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The immunoblot detection procedure was similar to that done for E. coli blots described above. 
For the chemiluminescent detection, the S. Tag™ AP Lumiblot kit (Novagen) was used. 
ELISA Quantification of CTB: Different concentrations (100 jwI/weIl) l o"f 100 mg leaves 
(transformed and untransformed plants) ground with liquid nitrogen and re^uspended in 
bicarbonate buffer, pH 9.6 (15mM Na 2 C0 3 - 35mM NaHC0 3 ) were bound to a 96 well polyvinyl 
chloride microliter plate (Costar) overnight at 4°C. The background was blocked with 1% 
Bovine serum albumin (BSA) in 0.01M phosphate buffered saline (PBS) for 2h at 37°C, washed 
thrice with washing buffer, PBST (PfiS and 0. 05% Tween 20) and rabbit anti-cholera serum 
diluted 1 :8,000 in PBST containing 0.5% BSA was added and incubated for 2h at 37°C. The 
wells were washed and incubated with 1:50,000 mouse anti rabbit IgG-alkaline phosphatase 
conjugate in PBST containing 0.5% BSA for 2h at 37°C. Th e plate was developed with Sigma 
Fast pNPP substrate (Sigma) for 30 minutes at room temperature and the reaction was ended by 
addition of 3N NaOH and plates were read at 405 nm. 

GMi Ganglioside Binding Assay: To determine the affinity of chloroplast derived CTB for 
GMi-gangliosides, microliter plates were coated with monosialogangli oside-GM! (Sigma) (3.0 
Mg/ml in bicarb, buffer) and incubated at 4°C overnight. As a control, BSA 
(3.0 jug/ml in bicarb, buffer) was coated on some wells. The wells were blocked with 1% 
BSA in PBS for 2h at 37 °C, washed thrice with washing buffer, PBST and incubated with 
dilutions of transformed plant protein, untransformed plant prot ein and bacterial CTB in PBS. 
incubation of plates with primary and secondary antibody dilutions and detection was similar to 
the CTB ELISA procedure described above. , % 
pLD-LH-CTB vector construction and E. coli expression: The pLD-LH-CTB vector 
integrates the genes of interest into the inverted repeat regions of the chloroplast genome 
between the trnl and trnA genes. Integration o ccurs through homologous recombination events 
between the trnl and trnA chloroplast border sequences of the vector and the corr esponding 
homologous sequences of the chloroplast genome as shown in Fig. 27A. The chimeric 
aminoglycoside 3' adenyl transferase (aadA) gene that confers resistance to spectinomycin- 
Streptomycin and the CTB gene downstream of it are driven by the constitutive promoter of the 
rRNA operon (Prrn) and transcription is terminated by the psbA3' untranslated region. Since the 
protein synthetic machinery of chloroplasts is similar to that of E. coli (23), CTB expression of 
the pLD-LH-CTB vector in E. coli was tested. Western blot analysis of sonicated E. coli whole 
cell extract showed the presence ••of4^'-ld3a 'GTB monomers, similar to that obtained when , 
/purified commercially available CTB was treated in the same manner as shown in Fig. 28A. v 
Oligomeric expression of CTB was not observed in E. coli, as expected, due to the absence of a 
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leader peptide sequence present in the native CTB gene that directs th e CTB monomer into the 
periplasmic space allowing for concentration an d oligomeric assembly. 

Selection and Regeneration of Transgenicffi^ leaf pieces when placed on, 

selection medium continued jto grow buf ^Vvere bleached. Green shoots emerged from the part of 
the leaf in contact witli the medium. Five rounds of bombardment (5 leaves each) resulted in 68 
independent transformation events. Each such transgenic line was subjected to a second round 
of antibiotic selection. These putative transformants were subjected to PCR analysis to 
distinguish from nuclear transformants and mutants. 

Determination of Chloroplast Integration and Homoplasmy : PCR and Southern 

^ * ■ — — — 

hybridization were used to determine integration of the CTB gene into the chloroplast genome. 

Primers, 3P and 3M, designed to confirm incorporation of the gene cassette into the chloroplast 
genome were used to screen putative transgenics initially. The primer, 3P, landed on the 
chloroplast genome outside of the chloroplast flanking sequence used for homologous . 
recombination as shown in Fig. 27A. The primer, 3M, landed on the aadA gene. No PCR 
product should be obtained if foreign genes are integrated into the nuclear genome or in mutants 
lacking the aadA gene. The presence of the 1 .6kb PCR product in 9 of the 10 putative 
transgenics screened, confirmed the site-specific integration of the gene cassette into the 
chloroplast genome. Database searches showed that no random priming took place as both the 
3P and 3M primers showed no homology with other gene sequences. This is confirmed by the 
. absence of PCR product in untransformed plants (Fig. 27B). Similar strategy has been used 
successfully by us in order to confirm chloroplast integration of foreign genes (13,14,24,25). 
This screening is essential to eliminate mutants and nuclear transformants and saves space and 
labor of maintaining hundreds of transgenic lines. 

Southern blot analysis of three of the PCR positive transgenic lines was done to further 

1 1 1 •■ : — , i. i ., i — ^ » ■ i 

confirm site specific integration and to establish copy number. In the chloroplast genome, Bglll 

sites flank the chloroplast border sequences 5' of 16S rRNA and 3' of the trnA region as shown in 

Fig. 29 A. A 6.17kb fragment from a transformed plant and a 4.47 kb fragment from an 

untransformed plant were obtained when total plant DNA from transformed and untransformed 

plants was digested with Bglll. The blot of the digested products was probed with a 32 P random 

primef-Tabeled 0.81 kb trnl-trhA fragment. The probe hybridized with the control giving a 4.47 

kb fragment as expected, while for the transgenic lines a 6.17 kb fragment was observed, 

indicating that all plastid genomes had the gene cassette inserted between the tml and trnA 

regions. The absence of a 4.47 kb fragment in transgenic lines indicates tha t homoplasmy has 

been achieved, to the detection level of a Southern blot. These results ex plain the high levels of 

& " ~ 

CTB observed in transgenic tobacco plants. Southern blot confirmed plants transferred to pots 
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were seen to have no adverse pleiotropic effects when compared to untransformed plants as 
shown in Fig.4A. Southern blot analysis of Ti plants in Fig. 3C shows that all 4 transgenic lines 
analyzed maintained homoplasmy. 

Immunoblot Analysis of Chloroplast Synthesized CTB: Anti-cholera toxin antibodies did not 
show significant cross-reaction with tobacco plant protein as can be seen in Fig. 28C, lanes 1 & 
2. Boiled and unboiled leaf homogenates were run on 12% SDS PAGE gels. Unboiled 
chloroplast synthesized CTB protein appeared as compact 45 kDa oligomers as shown in Fig. 
28C, lane 4 similar to the unboiled, pentameric bacterial CTB which appeared to have partially 
dissociated into tetramers, trimers and monomers upon storage at 4°C over a period of several 
months from Fig. 28C, Iane7. 

While heat treatment (4 min. boiling) prior to SDS PAGE of pentameric bacterial CTB, 
gave CTB monomers predominantly, with some protein in the dimeric and trimeric form as 
shown in Fig. 28C, lane 6, chloroplast synthesized CTB dissociated into dimers and trimers only, 
when subjected to similar heat treatment as in Fig. 28C, lanes 3 & 5. These results are different 
from the heat induced dissociation of potato plant nucleus synthesized CTB; oligomers into 
monomers (8). A probable reason for this stability could be a more stable conformation of 
chloroplast synthesized CTB which maybe an added advantage in storage and administration of 
edible vaccines. Leaf homogenates from four different transgenic plants showed almost similar 
expression levels of CTB protein (see Fig. 28B). Thi s suggests very little clonal variation of 
CTB expression, as was confinned later by ELISA quantification assays. Consistent expression 
levels of recombinant proteins in plants (as obtained for CTB in this research) may be essential 
for production of ed ible vaccines in plants. 

ELISA Quantification of CTB Expression: Comparison of the absorb ance at 405nm of a 
known amount of bacteria l CTB - antibody complex (linear standard curve) and that of a kno\Vn 
concentration of transformed plant total soluble protein was used to estimate CTB expression ^ 
levels. Optimal dilutions o f total soluble protein from two transgenic lines were loaded in wells 
of the microliter plate. As reported previously (8), it was necessary to optimize the dilutions of 
total soluble protein, as levels of CTB protein detected varied with the concentration of total 
soluble protein, resulting in too high concentrati ons of total soluble protein inhibiting the CTB 
protein from binding to the wells of the plate. Both T 0 lines yielded CTB protein levels ranging 
between 3.5% to 4.1 % of the to tal soluble protein (40 ju,g of chloroplast synthesized CTB protein 
in 1 mg of total soluble protein) as shown in Fig. 31 A. Also, estimation of CTB protein 
expression levels from diff erent stages of leaves - young, mature and old determined that mature 
leaves have the highest levels of CTB protein expression. This is in accordance with the results 
obtained when similar experiments were performed when the Bt Cry2aA2 gene was expressed" 
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without the putative chaperonin genes, but contrary to results with the Bt Cry2aA2 operon, 
which showed high expression levels in older leaves, probably due to the stable crystalline 
structure (13). 

GMi Ganglioside ELISA Binding Assays: Both chlor oplast synthesized and bacterial CTB 
demonstrated a strong affinity for GMI, - gangliosides (see Fig. 3 IB) indicating that chloroplast 
synthesized CTB conserved the antigenic sites necessary for binding of the CTB pentamer to the 
pentasaccharide GMiL The GMi binding ability also . suggests proper folding of CTB molecules 
resulting in the pentameric structure. Since oxidation of cysteine residues in the B subunits is a 
prerequisite for in vivo formation of CTB pentamers (20), proper folding is a further 
confirmation of the ability of chloroplasts to form disulfide bonds. 

High levels of expression of CTB in transgenic tobacco did not affect growth rates,^ 

flowering or seed setting as has been observed in this laboratory, unlike previously reported for 
^ the synthetic LTB gene, constitutively expressed via the nuclear genome (7). Transformed plan t 
seedlings were green in color while untransformed seed lings lacking the aadA gene were ^ 
bleached white as shown in Fig. 4B when germinated on antibiotic medium x 

The potential use of this technology is three-fold. While, it can be used for large scale 
production of purified CTB, it can also be used as an edible vaccine if expressed in an edible 
plant or as a transmucosal carrier of peptides to which it is fused to, so as to either enhance 
mucosal immunity or to induce oral tolerance to the products of these peptides (5). Large-scale 
production of purified CTB in bacteria involves the use of expensive fermentation techniques 
and stringent purification protocols (26) making this a prohibitively expensive technology for 
developing countries. The cost of producing 1kg of recombinant protein in transgenic crops has 
been estimated to be 50 times lower than the cost of producing the same amount by E. coli 
fermentation, assuming that recombinant protein is 20% of total E.coli protein (27). Thus, 
isolation and lysis of CTB producing chloroplasts from chloroplast transformed plants could 
^ serve as a cost-effective means of mass production of purified CTB.' If used as an edible 
vaccine, a selection scheme eliminating the use of antibiotic resistant genes should be developed. 
One such scheme uses the betaine aldehyde dehydogenase (BADH) gene, which converts toxic 
betaine aldehyde to nontoxic glycine betaine, an osmoprotectant (28). Also, several other 
strategies have been proposed to eliminate antibiotic-resistant genes from transgenic plants (29). 

Transgenic potato plants that synthesize a CTB-insulin fusion protein at l evels of up to 
0.1% of the total soluble tuber protein have been found to show a substantial reduction in 
pancreatic islet inflammation and a delay in the progression of clinical diabetes (30). This may 
prove to be an effective clinical approach for prevention of spontaneous autoimmune diabetes. 
Since, increased CTB expression levels have been shown to be achievable via the chloroplast 



82 



1465-PCT-OO (1577-P-00) 

genome thr ough this research, exp ression of a CTB-proinsulin fusion protein in the chloroplast s 
of edible tobacco (LAMD) is currently being tested in our laboratory. While existing expression 
levels of CTB via the chloroplast genome are adequate for commercial exploitation, levels can 
be increased further (about 10 fold) by insertion of a putative chaperonin, .as in the case of the Bt 
Cry2aA2 operon, (13) which likely aids in the subsequent purification of recombinant CTB due 
to crystallization. 
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